AD-A274  388 


AFIT/GE/ENG/93D-04 


THREE  DIMENSIONAL  OBJECT  RECOGNITION 
USING  A  COMPLEX  AUTOREGRESSIVE  MODEL 


THESIS 

David  Eugene  Chelen 
Captain,  USAF 

AFIT/GE/ENG/93D-04 


Approved  for  public  release;  distribution  unlimited 


93 


12  22  178 


AFIT/GE/ENG/93D-04 


THREE  DIMENSIONAL  OBJECT  RECOGNITION 
USING  A  COMPLEX  AUTOREGRESSIVE  MODEL 


THESIS 


Presented  to  the  Faculty  of  the  Graduate  School  of  Engineering 
of  the  Air  Force  Institute  of  Technology 
Air  University 


In  Partial  Fulfillment  of  the 


Requirements  lor  the  Degree  of 

Accasion  ^ 

Master  of  Science 

NTi3  CRA4I  df 

OTIC  TAB  ^ 

Unannounead  □ 

Justification 

ay 

David  Eugene  Chelen,  B.S. 

Oisihibirtion; 

Captain,  USAF 

AvaitaMHy  Codaa 

DM 

Avon  and/or 

S^ial 

1 

ti 

_J _ 

Approved  for  public  release;  distribution  unlimited 


Acknowledgements 


My  Ruckstrations  at  AFIT  have  finally  come  to  an  end!  Before  signing  off,  though,  I  would 
like  to  th2uik  all  of  the  folks  who  pulled  me  (kicking  and  complaining)  through  these  hallowed  halls 
of  higher  education.  Thanks  to  my  advisor.  Captain  Dennis  (Doc)  Ruck,  I  was  able  to  conduct  my 
autoregressive  Pattern  Ruck  research  with  more  unrestricted  freedom  than  that  of  most  students. 
For  keeping  me  in  the  right  church  (just  not  always  the  right  pew),  I  thank  committee  member, 
Dr  Steven  Rogers.  I  thank  Dr  Mark  Oxley,  committee  member  #2,  for  his  attempt  to  translate 
my  mathematical  hieroglyphics  into  comprehendable  expressions.  I  also  give  special  thanks  to  my 
adopted  pseudo-committee  member  <ind  all  around  good  guy.  Captain  Ken  Fieiding,  for  his  patience 
and  much  appreciated  efforts  at  trying  to  make  that  darn  HMM  work!  Oh,  well  -  at  least  we  had 
some  really  cool  imagery! 

On  the  flip  side,  I  need  to  thank  Capt  Kurt  Knurr,  Lt  Tim  Pennington,  and  Lt  Curtis  Martin, 
for  showing  me  that  computers  really  can  be  reasonable  (if  you  push  all  the  right  buttons).  To  the 
Night-Shift  crew,  we’re  stupid,  but  we’re  gragiatin’!  And  to  my  use  or  lose  family,  a  special  thanks 
for  your  unconditional  love  and  support. 

Finally,  I  thank  my  Lord  and  Savior,  Jesus  Christ,  who  has  held  my  hand  (if  not  carried  me) 
through  more  challenges  than  anyone. 


David  Eugene  Chelen 


Table  of  Contents 


Page 

Acknowledgements .  ii 

List  of  Figures .  vi 

List  of  Tables .  ix 

Abstract .  x 

I.  Introduction .  1-1 

1.1  Problem .  1-1 

1.2  Background .  1-2 

1.3  A«:.umptions .  1-2 

1.4  Scope .  1-3 

1.5  Approach .  1-3 

1.6  Overview  of  Thesis .  1-4 

II.  Literature  Review  .  2-1 

2.1  Introduction .  2-1 

2.2  The  Autoregressive  Model .  2-1 

2.3  Recent  Studies .  2-2 

2.3.1  Bivariate  Autoregressive  Model .  2-2 

2.3.2  Complex  Autoregressive  Model .  2-4 

2.3.3  Autoregressive  Model  Classification  by  Hidden  Markov  Model  2-5 

2.4  Chapter  Summary .  2-6 

III.  Methodology  .  3-1 

3.1  Introduction .  3-1 

3.2  CPARCOR  Algorithm  Development .  3-1 

iii 


Page 

3.3  CPARCOR  Algorithm  Software  Validation .  3-3 

3.3.1  Generation  of  Test  Images .  3-4 

3.3.2  Load  and  Enhance  Image  Subroutine .  3-5 

3.3.3  TVace  Subroutine .  3-6 

3.3.4  Sampling  Subroutine .  3-8 

3.3.5  Coefficient  Stability  Test  .  3-8 

3.3.6  Algorithm  Verihcation  Test .  3-11 

3.3.7  Section  Summary .  3-12 

3.4  Classiher  Application .  3-12 

3.4.1  Modifications .  3-14 

3.4.2  Template  Matching  Algorithm .  3-16 

3.4.3  K-Nearest-Neighbor  (KNN)  Techniques .  3-25 

3.4.4  Hidden  Markov  Model  (HMM)  Testing .  3-25 

3.5  Chapter  Summary .  3-29 

IV.  Results  and  Discussion .  4-1 

4.1  Template  Test  Results  . .  4-1 

4.1.1  Matrix  Reduction  and  Unstored  View  Testing .  4-2 

4.1.2  Occlusion  Testing  .  4-6 

4.1.3  Discussion  of  Template  Test  Results .  4-12 

4.2  K-Nearest-Neighbor  (KNN)  Test  Results  .  4-13 

4.2.1  Single-Look  1-NN  Testing .  4-14 

4.2.2  Multiple-Look  1-NN  Testing .  4-15 

4.2.3  Discussion  of  KNN  Test  Results  .  4-15 

4.3  Hidden  Markov  Model  (HMM)  Test  Results .  4-16 

4.4  Chapter  Summary .  4-21 


IV 


Page 

V.  Conclusions  and  Recommendations .  5-1 

5.1  Conclusions .  5-1 

5.1.1  Template  Matching  Algorithm  Results .  5-2 

5.1.2  K-Nearest-Neighbor  (KNN)  Technique  Results .  5-2 

5.1.3  Hidden  Markov  Model  (HMM)  Test  Results .  5-3 

5.2  Recommendations .  5-3 

5.3  Future  Research .  5-3 

Appendix  A.  Supplemental  Test  Results .  A-1 

Appendix  B.  White  Noise  Test  Source  Code .  B-1 

B.l  Introduction .  B-1 

B. 2  MATLAB  Test  Routines  and  Data  Sets .  B-1 

Appendix  C.  CPARCOR  Algorithm  Source  Code .  C-1 

C. l  Introduction .  C-1 

C. 2  CPARCOR  Algorithm .  C-1 

Appendix  D.  Template  Matching  Algorithm .  D-1 

D. l  Introduction .  D-1 

D.2  Matrix  Source  Code .  D-2 

D.3  Template  Matching  Algorithm  Source  Code .  D-7 

Bibliography .  BIB-1 

Vita .  VITA-1 


V 


List  of  Figures 


Figure  Page 

2.1.  Example  of  complex  coordinates  expression .  2-2 

3.1.  Initial  test  images:  (a)  Geometric  shapes  (b)  Typical  machine  parts .  3-3 

3.2.  Primary  test  images .  3-4 

3.3.  Typical  MATLAB  image  (M60  Tank) .  3-5 

3.4.  Effect  of  Mask  Subroutine  on  typical  image .  3-6 

3.5.  Effect  of  Trace  Subroutine  on  typical  image .  3-7 

3.6.  Effect  of  Sampling  Subroutine  on  typical  image .  3-8 

3.7.  Result  of  complex  white  noise  test  for  CPARCOR  algorithm .  3-11 

3.8.  CPARCOR  coefficient  PSRI  comparisons,  (a)  Feature  vectors  for  scaled  and 

rotated  square  and  machine  part  (b)  Mean  value  of  square  and  machine  feature 
vectors .  3-13 

3.9.  Characteristic  Views  in  360°  Template  (a)  Basic  template  format  (b)  Represen¬ 
tative  test  images  for  M60  Tank .  3-18 

3.10.  Characteristic  Views  for  M60  Tank .  3-20 

3.11.  Verification  of  previously  reported  occluded  view  recognition  rates.  Test  set  con¬ 
sisted  of  occluded  M60  Tank  views  only.  Recognition  rates  for  (a)  Dissimilar  3 

class  (MOO,  BTR,  M35)  (b)  Similar  3  class  (M60,  M2,  T62) .  3-23 

3.12.  CPARCOR  coefficient  occlusion  comparisons,  (a)  Feature  vectors  for  MOO  char¬ 

acteristic  views  occluded  0,  5,  10,  and  20%  (b)  Mean  value  of  occluded  feature 
vectors .  3-24 

3.13.  Example  of  MOO  occluded  views  for  (a)  5%  (b)  10%  (c)  20% .  3-26 

3.14.  Example  of  Mean  Distortion  Curves  (MDC)  for  cluster  centers  (Fourier  features 

shown) .  3-27 

3.15.  Example  of  sphere  mapping  (Fourier  features  shown) .  3-28 

3.16.  Example  of  sphere  trajectories .  3-29 


VI 


Figure  Page 

4.1.  Comparison  of  90°  and  360°  template  for  unstored  view  recognition  results  with 

3  classes .  4-4 

4.2.  Comparison  of  90°  and  360°  template  for  unstored  view  recognition  results  with 

5  classes .  4-5 

4.3.  Unstored  view  recognition  results  for  3  classes  and  various  90°  view  templates. .  4-7 

4.4.  Unstored  view  recognition  results  for  5  classes  and  various  90°  view  templates. .  4-8 

4.5.  Three  class  recognition  rates  for  occluded  M60  views.  Various  matrix  sizes  used 

are  (a)  2  x  2  Matrix  (b)  5  x  5  Matrix  (c)  10  x  10  Matrix  (d)  19  x  19  Matrix.  .  .  4-10 

4.6.  Five  class  recognition  rates  for  occluded  M60  views.  Various  matrix  sizes  used 

are  (a)  2  x  2  Matrix  (b)  5  x  5  Matrix  (c)  10  x  10  Matrix  (d)  19  x  19  Matrix.  .  .  4-11 

4.7.  Comparison  of  Mean  Distortion  Curves  (MDC)  for  cluster  centers  (a)  CPARCOR 

features  (b)  Fourier  features .  4-17 

4.8.  Comparison  of  5  class  sphere  mapping  (a)  CPARCOR  features  (b)  Fourier  features.  4-19 

4.9.  Comparison  of  5  class  (a)  Mean  and  (b)  Standard  Deviation  of  CPARCOR  feature 

vectors  (unnormalized  data) .  4-20 

A.l.  Single-order  CPARCOR  coefficient  recognition  analysis;  (a)  Recognition  results 

for  all  five  classes  (b)  Mean  value  of  recognition  results  (all  five  classes) .  A-2 

A. 2,  Unstored  view  comparison  using  360°(3  X  3)  for  (a)  3  Classes  and  (b)  5  Classes.  A-3 

A.3.  Comparison  of  recognition  results  for  3  and  5  Class,  360°(3  x  3)  template.  .  .  .  A-4 

A.4.  Comparison  of  180°(2  x  3)  and  360°(3  x  3)  template  for  unstored  view  recognition 

results  with  3  classes .  A-5 

A. 5.  Comparison  using  180°  template  for  unstored  view  recognition.  Results  shown 

for  (a)  M60  Tank  (b)  BTR  APC  (c)  M35  TVuck .  A-6 

A.6.  Comparison  using  90°  (2  x  2, 5  x  5, 10  x  10)  and  180°  (2  x  3, 5  x  9, 10  x  19)  templates 

for  unstored  view  recognition  of  the  M60 .  A-7 

A.7.  Comparison  using  90°  (2  x  2, 5  x  5, 10  x  10)  and  180°  (2x3,5x9,10x19)  templates 

for  unstored  view  recognition  of  the  BTR .  A-8 

A. 8.  Comparison  using  90°  (2  x  2, 5  x  5, 10  x  10)  and  180°  (2  x  3, 5  x  9, 10  x  19)  templates 

for  unstored  view  recognition  of  the  M35 .  A-9 

vii 


Figure  Page 

A.9.  Comparison  of  Occlusion  results  for  15  occluded  M60  characteristic  views  using 

dissimilar  3  class  set  (M60,  BTR,  M35) .  A-10 

A. 10.  Occlusion  results  for  15  occluded  MOO  characteristic  views  using  similar  Z  class 

set  (M60,  M2,  T62) .  A-11 

A. 11.  Comparison  of  90°  and  360°  templates  for  recognition  of  occluded  M60  charac¬ 
teristic  views  (a)  3  Classes  (b)  5  Classes .  A-12 

A. 12.  3  class  recognition  rates  for  10  occluded  M2  views.  Various  matrix  sizes  used  are 

(a)  2  X  2  Matrix  (b)  5  x  5  Matrix  (c)  10  x  10  Matrix  (d)  19  x  19  Matrix .  A- 13 

A.  13.  3  class  recognition  rates  for  10  occluded  M2  views.  Various  matrix  sizes  used  are 

(a)  2  X  2  Matrix  (b)  5  x  5  Matrix  (c)  10  x  10  Matrix  (d)  19  x  19  Matrix .  A-14 

A.  14.  Mean  value  of  occlusion  recognition  rates  for  various  size  templates.  M60  and 

M35  results  shown  for  3  and  5  Classes .  A-15 


viii 


List  of  Tables 


Table  Page 

3.1.  White  noise  test  coefficient  comparison .  3-10 

3.2.  Unstored  view  test  set  (Clas8_Azimuth^levation) .  3-20 

3.3.  Views  occluded  5,  10,  and  20% .  3-21 

3.4.  MOO  and  M2  views  occluded  5,  10,  and  20% .  3-22 

4.1.  Single-Look  test  results  for  CPARCOR  features .  4-14 

4.2.  Single-Look  test  results  for  Fourier  features .  4-14 

4.3.  Multiple-Look  test  results  for  CPARCOR  features .  4-14 

4.4.  Multiple- Look  test  results  for  Fourier  features .  4-15 


IX 


AFIT/GE/ENG/93D-04 


Abstract 

Based  on  an  autoregressive  model,  Complex  Partial  Correlation  (CPARCOR)  features  are 
known  to  provide  exceptional  Position,  Scale,  and  Rotation  Invariant  (PSRI)  properties  for  plwar 
2-Dimen8ional  (2-D)  object  recognition.  Although  autoregressive  models  have  been  successfully 
applied  to  numerous  spatio-temporal  recognition  tasks,  the  effects  of  out-of-plant  image  rotations 
were  never  considered.  This  study  investigates  application  of  the  CPARCOR  model  to  a  five  class 
problem  of  nonplanar  2-D  views  of  3-D  objects.  Recognition  based  on  CPARCOR  features  is 
evaluated  using  a  Template  Matching  algorithm,  two  K- Nearest- Neighbor  (KNN)  classifiers,  and 
a  Hidden  Markov  Model  (HMM).  Direct  comparisons  to  recognition  based  on  Fourier  features  are 
made.  Results  indicate  that  the  CPARCOR  model  parameters  provide  useful  shape-features  for 
recognition  of  out-of-plane  rotations.  Displaying  exceptional  PSRI  properties,  the  features  are 
shown  capable  of  classification  by  simple  nonadaptive  recognition  schemes.  Relatively  successful 
results  are  obtained  for  a  variety  of  tests.  The  advantage  of  classification  by  a  multiple-look  tech¬ 
nique  over  the  traditional  single-look  method  is  clearly  demonstrated.  Feature  space  crowding  is 
noted  as  the  cause  of  unusual  recognition  rates  for  occluded-view  tests.  Although  general  trends 
are  noted,  optimal  model  order  and  selection  of  CPARCOR  versus  Fourier  features  are  considered 
application  dependent. 
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THREE  DIMENSIONAL  OBJECT  RECOGNITION 
USING  A  COMPLEX  AUTOREGRESSIVE  MODEL 

/.  Introduction 

Automatic  target  recognition  capabilities  are  obviously  very  important  to  the  Air  Force  mis¬ 
sion.  The  majority  of  automatic  target  recognition  systems  currently  undergoing  Air  Force  research 
are  based  on  performing  a  given  processing  technique  on  a  single  frame  of  sensor  (visual  or  infrared) 
imagery.  Determination  of  desired  information  (such  as  target  classification)  is  made,  and  then  a 
new  frame  of  imagery  is  analyzed  and  the  process  repeated  [6].  Current  methods  do  not  account  for 
information  contained  in  the  spatio-temporal  changes  an  object  undergoes  as  it  moves  relative  to 
an  observer  (or  vice  versa).  This  information  may  be  a  useful  aid  in  the  interpretation  of  a  target’s 
motion  and  recognition  [14] . 

Based  on  a  complex  autoregressive  model,  Complex  Partial  Correlation  (CPARCOR)  fea¬ 
tures  are  known  to  provide  exceptional  Position,  Scale,  and  Rotation  Invariant  (PSRI)  properties 
for  application  to  planar  2-Dimensional  (2-D)  object  recognition [17].  Although  previous  research 
hcis  successfully  applied  autoregressive  models  to  spatio-temporal  recognition  tasks[2][3][9][17],  the 
effects  of  otti-of-plane  image  rotations  were  never  considered. 

J.J  Problem 

The  goal  of  this  thesis  is  to  demonstrate  application  of  the  CPARCOR  algorithm  to  spatio- 
temporal  recognition  of  3-D  objects.  Classification  based  on  spatio-temporal  information  is  ex¬ 
pected  to  yield  superior  results  compared  to  single-frame  techniques  [7].  For  this  research,  the 
CPARCOR  method  of  feature  extraction  will  be  applied  to  nonplanar  2-D  views  of  3-D  objects. 
Recognition  based  on  CPARCOR  features  will  be  evaluated  using  the  following  classification  tech¬ 
niques: 


1-1 


•  Template  Matching  Algorithm:  Using  a  Euclidean  distance  metric,  single-frame  recognition 
performance  will  be  evaluated  using  test  sets  of  unstored  and  occluded  characteristic  views.  A 
chMacteristic  view  is  basically  an  object  orientation  particular  to  a  given  class,  while  unstored 
views  are  simply  characteristic  views  that  are  not  included  (or  stored)  in  the  respective 
template. 

•  K~ Nearest- Neighbor  (KNN)  Techniques:  Recognition  of  both  single  and  multiple  frames  of 
imagery  will  be  performed  using  the  hold-one-out  technique  and  Single/Multiple-Look  1-NN 
classifiers.  A  direct  comparison  with  recognition  by  Fourier  features  will  be  made. 

•  Hidden  Markov  Model  (HMM):  Temporal  sequence  recognition  by  CPARCOR  features  will 
be  attempted  for  a  direct  (apples  to  apples)  comparison  with  recognition  by  Fourier  features. 

1.2  Background 

Although  a  variety  of  shape  descriptors  have  been  developed,  few  have  been  able  to  match  the 
PSRI  properties  of  the  Complex  PARCOR  features.  Based  on  an  extension  of  the  real  PARCOR 
coefficients  (used  for  speech  signal  processing),  the  CPARCOR  features  are  calculated  from  sampled 
boundaries  of  complicated,  non-convex  2-D  objects.  Basic  recognition  cein  then  be  accomplished 
with  a  Euclidean  metric  to  measure  distance  between  the  coefficients.  Exceptional  recognition  of 
planar  2-D  objects  corrupted  by  both  noise  and  partial  occlusion  has  been  reported  [17]. 

1.3  Assumptions 

Difficulties  encountered  with  object  recognition  systems  are  best  summarized  in  the  following 
statement,  “Segmenting  a  3-D  object  cleanly  from  a  complex  scene  is  a  very  difficult  problem  in 
general  because  of  interference  from  noise,  occluding  objects,  background,  illumination,  and  spatial 
sampling  effects[16;108].”  To  avoid  these  problems,  the  following  assumptions  for  image  generation 
will  be  made  at  the  outset  of  this  thesis: 
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•  Complete  segmentation  of  the  object  from  a  complex  scene 

•  Recognition  of  only  a  single  object  per  frame/sequence 

•  No  undesired  image  corruption  due  to  noise  interference 

1-4  Scope 

The  scope  of  this  thesis  will  focus  on  an  evaluation  of  CPARCOR  features  through  application 
of  sever2d  known  classification  techniques.  Using  five  classes,  imagery  will  consists  of  nonplanar  2-D 
representations  of  complicated  3-D  images.  Particular  emphasis  shall  be  devoted  to  the  areas  of 
unstored/occluded  (Template  Matching),  single/multiple-look  (KNN),  and  temporal  (HMM)  image 
recognition. 

J.5  Approach 

Object  recognition  based  on  CPARCOR  features  will  be  accomplished  in  the  following  five 

steps: 

1.  The  CPARCOR  Algorithm  will  be  created  using  the  MATLAB  programming  environment. 
Stability  of  the  CPARCOR  coefficients  will  be  verified  through  application  of  a  complex  (gaus- 
sian)  white  noise  sequence  to  the  model’s  transfer  function.  Previously  reported  classification 
results  (based  on  CPARCOR  features)  will  then  be  verified  by  using  simple  shapes  (squares, 
circles,  triangles,  etc.)  and  a  1-Nearest  Neighbor  (1-NN)  recognition  routine. 

2.  For  primary  recognition  testing,  highly  detailed  test  images  will  be  generated  from  the  BRL- 
CAD  software  [1].  Test  images  include  the  Army’s  M60  Tank,  M35  TVuck,  BTR60  Armored 
Personnel  Carrier(APC),  T62  Tank,  and  M2  Infantry  Fighting  Vehicle  (IFV).  All  test  images 
will  be  128  x  128  ASCII  arrays,  consisting  of  views  taken  every  five  degrees  from  0°  to  355°  in 
azimuth  and  0°  to  90°  in  elevation.  Recognition  performance  based  on  CPARCOR  features 
will  be  evaluated  using  several  classification  techniques  (Items  3,4,5). 
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3.  Classification  by  a  Template  Matching  Algorithm  (Euclidean  distance  metric)  will  examine 
the  impact  of  unstored  and  partially  occluded  views  on  single-frame  recognition  performance. 
Using  unnormalized  CPARCOR  features,  various  template  sets  will  be  made  for  object  views 
of  360°,  180°,  and  90°  in  azimuth.  Individual  templates  will  also  provide  a  viewing  range 
from  0°  to  90°  in  elevation.  Using  both  three  and  five  class  problems,  the  matrix  size  of  each 
template  will  vary  in  order  to  examine  the  effective  recognition  rates. 

4.  Two  versions  of  a  1-NN  classifier  will  be  used  to  compare  the  effects  of  single-look  versus 
multiple-look  recognition  on  unnormalized  CPARCOR  features.  Using  a  five  class  problem, 
recognition  of  both  single-frame  and  multiple-frame  sequences  of  uncorrupied  (no  noise/occlusion) 
imagery  will  be  attempted.  A  direct  comparison  with  recognition  by  Fourier  features  will  be 
made. 

5.  A  Hidden  Markov  Model  (HMM)  will  be  applied  to  determine  if  CPARCOR  features  can 
accurately  represent  temporally  encoded  sequences  of  images.  The  HMM  classifier  is  known 
to  display  exceptional  recognition  rates  (over  97%)  for  temporal  sequences  represented  by  a 
low  frequency,  Fourier  magnitude  feature  set  [7].  Thus,  a  direct  {apples  to  apples)  five  class 
performance  comparison  will  be  attempted  by  simply  replacing  the  Fourier  features  with  a 
set  of  CPARCOR  features. 

1.6  Overview  of  Thesis 

This  thesis  consists  of  the  following  chapters: 

Chapter  //provides  a  literature  review  of  the  most  recent  studies  using  autoregressive  features 
for  image  recognition.  In  particular,  the  CPARCOR  algorithm  is  noted  for  its  exceptionally  high 
accuracy  when  applied  to  recognition  of  planar  2-D  objects  corrupted  by  both  noise  and  partial 
occlusion. 


1-4 


Chapter  III  covers  the  development  and  testing  of  the  CPARCOR  algorithm.  Along  with  the 
fundamental  principles  of  the  algorithm,  the  methodology  used  to  verify  both  the  main  program  and 
supporting  subroutines  is  described.  The  chapter  concludes  with  a  description  of  several  recognition 
tests  conducted  using  CPARCOR  features  and  various  classifiers. 

Chapter  IV  provides  a  discussion  of  the  results  for  tests  described  in  Chapter  III.  Topics  are 
presented  in  the  same  order  as  they  appear  in  Chapter  III.  For  clarity,  additional  test  results  were 
included  in  Appendix  A. 

Chapter  V  provides  conclusions  and  several  recommendations  for  future  research  based  on 
the  results  discussed  in  Chapter  IV. 


1-5 


//.  Literature  Review 


S.  1  Introduction 

A  subset  of  the  most  recent  studies  using  autoregressive  features  for  image  recofuition  is 
presented.  For  this  research,  image  recognition  will  consist  of  the  classification  of  objects  (such  as 
tanks,  trucks,  etc.)  represented  by  Complex  Partial  Correlation  (CPARCOR)  features. 

S.2  The  Autoregressive  Model 

The  ability  to  accurately  classify  objects  in  a  scene  is  of  great  importance  with  both  military 
and  civilian  application[2].  Related  to  the  salient  features  used  by  humans  for  scene  analysis,  clas¬ 
sification  by  features  based  on  the  boundaries  (or  edges)  of  shapes  has  shown  promising  results[5]. 
Although  a  variety  of  shape  descriptors  exist,  three  general  categories  include:  scalar  transform, 
space  domain,  and  curve  fitting  techniques[2].  Considered  a  method  of  scalar  transform,  the  As* 
toRegressive  (AR)  model  is  noted  to  perform  exceptionally  well  when  applied  to  the  following  planar 
2-D  object  recognition  tests: 

•  Classification  of  scaled,  rotated,  and  translated  2-D  planar  images[2]. 

•  Classification  of  complicated,  partially  occluded  image8[3]. 

•  Temporal  sequence  classification  of  simple,  rotated,  and  occluded  2-D  planar  images  by  Hid¬ 
den  Maikov  Model  (HMM)[9]. 

Receiving  widespread  application  in  speech  recognition,  an  autoregressive  model  of  order  m 
is  defined  as  a  parametric  technique  that  expresses  data  samples  (from  an  ordered  set)  as  a  linear 
combination  of  the  preceding  m  samples  from  the  set  plus  an  error  term  (white  noise).  For  object 
recognition,  model  parameters  are  estimated  from  a  set  of  the  object’s  boundary  samples.  Functions 
of  the  model  parameters  can  be  made  invariuit  to  an  object’s  position,  scue,  and  rotation  by 
applying  appropriate  bor.Tidaiy  sampling  technique8[3][17].  Although  a  variety  of  methods  (equal 
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Zi  =  X|  ♦  lYi 

Figure  2.1  Example  of  complex  coordinates  expression. 

angle,  equal  curve  length,  etc.)  exists.  Figure  2.1[17]  shows  the  boundary  representation  used  by 
the  CAR/CPARCOR  model  (see  Section  2.3.2).  In  this  case,  each  boundary  point  is  represented 
by  a  complex  number  of  form  zj  =  xj  +  iyj.  A  sequence  of  complex  numbers,  Zj,  is  obtained. 
From  this  sequence,  an  autoregressive  model  of  order  m  is  defined  as  shown  below: 


m 

=  ^OkZj-k  +  j  €  Z  (2.1) 

k=l 

where  Z  is  the  set  of  integers,  are  the  CAR  coefficients,  and  €j  represents  an  error  term. 

2.3  R^rert  Studies 

The  following  three  sections  provide  highlights  of  the  most  recent  studies  using  autoregressive 
features  for  image  recognition.  Focusing  on  the  methodology  applied  to  each  test,  this  literature  re¬ 
view  will  provide  a  foundation  for  further  research  involving  the  autoregressive  model,  CPARCOR, 
for  3-D  object  recognition. 

2.3.1  Bivarieie  Auioreg.-essive  Model.  Monohar  Das  and  others  present  a  bivariate  autore¬ 
gressive  model  for  classification  of  closed  planar  8bapes[2].  Basically  an  extension  of  1-D  Complex 
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AutoRegressive  (CAR)  models,  a  scalar  transform  method  is  explored  and  shown  to  have  certain 
advantages  over  the  1-D  methods.  Shown  to  be  invariant  to  rotation,  translation,  and  scaling,  the 
bivariate  model  can  be  applied  to  both  convex  and  nonconvex  objects  without  loss  of  phase  infor¬ 
mation.  Depending  on  the  data  set,  the  model  allows  for  flexibility  in  choice  of  sampling  method, 
which  can  be  used  to  bias  classification  results.  Finally,  the  use  of  an  z,  y  coordinate  system  results 
in  lower  residual  errors  than  those  achieved  with  1-D  models. 

Three  data  sets  (nonoverlapping  simple  planar  shapes)  were  used  to  evaluate  model  sensitivity 
to  variations  in  boundary  shape,  larger  number  of  classes,  and  partial  boundary  occlusions,  respec¬ 
tively.  Several  shapes  were  taken  from  a  set  of  images  previously  applied  to  1-D  CAR  te8ting[3] . 
Using  35  images  per  class,  the  shapes  from  set  one  were  arbitrarily  rotated,  shifted,  and  scaled 
within  the  image  plane.  For  set  two,  50  images  per  class  were  selected  in  a  similar  fashion.  Set 
three,  a  subset  of  set  two  images,  consisted  of  10  different  partial  occlusions  for  each  of  four  shapes 
(40  total  images).  Occlusions  were  made  by  removing  up  to  60%  of  the  image.  Thresholded  to 
produce  binary  images,  shapes  were  traced  by  a  boundary  follower  algorithm.  A  polygonal  ap¬ 
proximation  was  calculated  for  each  shape  in  order  to  reduce  feature  vector  variance.  Using  an 
equal-curve-length  technique,  64  boundary  samples  were  taken  for  each  shape.  Classification  was 
based  on  the  feature  weighting  (FW)  and  rotated  coordinate  system  (RCS)  methods.  Model  order 
was  limited  to  four  due  to  100%  classificatio.  accuracy  achieved  with  a  second  order  model  (for 
nonoccluded  shapes). 

Results  indicated  that  the  bivariate  CAR  modeling  technique  provided  greater  overall  clas¬ 
sification  accuracy  than  the  1-D  models.  Although  occlusions  were  noted  to  degrade  classification 
performance,  the  bivariate  technique  was  shown  to  be  less  computationally  intense  than  other 
methods.  The  bivariate  model  was  also  considered  ideal  for  application  to  automated  inspection 
where  occlusions  were  not  likely. 
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2.3.S  Complex  Auiongrtasive  Model.  Few  shape  descriptors  have  been  able  to  match  the 
Position,  Scale,  and  Rotation  Invariant  (PSRI)  properties  of  the  Complex  PARCOR  coefficients. 
Based  on  an  extension  of  the  real  PARCOR  coefficients  (used  for  speech  signal  processing),  the 
CPARCOR  coefficients  are  calculated  from  sampled  bounduies  of  complicated,  non-convex  2-D 
objects.  Basic  recognition  can  then  be  accomplished  using  a  Euclidean  metric  to  measure  distance 
between  the  coefficients.  Exceptional  recognition  of  planar,  2-D  objects  corrupted  by  both  noise 
and  partial  occlusion  has  been  reported  [17]. 

Sekita  and  others  present  a  fast  algorithm  which  generates  both  2-D  CAR  and  CPARCOR 
coefficients  of  order  m.  Comparing  recognition  performance  of  the  CAR/CPARCOR  model  to 
various  other  methods  (1-D  AR,  moment  invariants,  Fourier  descriptors),  CPARCOR  coefficients 
are  shown  to  display  superior  performance.  Of  the  two  data  sets  used,  set  one  consisted  of  a  four 
class  set  of  typical  machine  parts,  while  set  two  contained  a  five  class  set  of  80  different  types  of 
plant  leaves.  Test  images  were  subject  to  position,  scale,  and  rotational  modifications,  as  well  as 
distortions  from  partial  occlusions  and  noise.  Discriminant  analysis  was  applied  to  absorb  intraclass 
variations,  and  the  Bayesian  decision  was  used. 

PSRI  testing  used  96  images  from  set  one  (4  classes  x  3  sizes  x  8  rotations)  and  the  Rotated 
Coordinate  System  (RCS)  method.  A  direct  comparison  of  model  orders  1  through  10  revealed 
that  CAR  and  CPARCOR  features  were  good  for  recognition  of  nonconvex,  complicated,  planar 
shapes.  Additioned  testing,  however,  indicated  that  CAR  coefficients  were  negatively  influenced  by 
the  number  of  representative  boundary  points,  while  the  CPARCOR  features  remained  consistent. 
Testing  of  set  two  revealed  similar  results.  Researchers  concluded[17]  that  models  of  low  order 
contain  enough  information  about  the  shapes,  while  higher  order  models  were  more  sensitive  to 
boundary  disturbances. 

Occlusion  testing,  performed  on  both  shape  sets,  involved  images  randomly  occluded  from  5 
to  10%.  Although  CAR  and  CPARCOR  coefficients  performed  equally  well  on  set  one  (machine 
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parts),  both  CPARCOR  and  Fourier  descriptors  outperformed  the  CAR  coefficients  on  set  two  data 
(plant  leaves).  Also,  the  negative  impact  of  the  number  of  representative  boundary  points  on  the 
CAR  coefficients  was  again  noted.  Thus,  the  CPARCOR  coefficients  were  generally  found  to  be 
more  stable  in  higher  order  models  than  the  CAR  features. 

2.S.S  Auioregresaive  Model  Classification  by  Hidden  Markov  Model.  He  and  Kundu[9] 
present  a  method  that  combines  the  Hidden  Markov  Model  (HMM)  with  autoregressive  parameters 
for  recognition  of  2-D  planar  shapes.  Closed  (unbroken  boundaury)  shapes  are  segmented  in  order 
to  examine  the  characteristic  relations  between  consecutive  segments  for  purposes  of  classification 
at  a  finer  level.  Tolerant  of  moderate  amounts  of  contour  perturbation  and  occlusion,  the  proposed 
classifier  is  also  insensitive  to  orientation  of  the  planar  shapes. 

The  authors’[9]  cite  the  autoregressive  model’s  primary  disadvantage  as  being  the  use  of  only 
one  set  of  predictive  parameters  to  model  an  entire  shape.  The  HMM,  on  the  other  hand,  is  noted 
to  explore  the  relationship  between  consecutive  segments  of  a  shape’s  boundary.  With  the  HMM, 
overall  better  results  were  achieved  compared  to  methods  that  represent  the  shape  with  only  one 
set  of  features.  The  entire  process  begins  by  segmenting  the  1-D  representation  of  a  closed  shape 
into  several  pieces.  AR  modeling  is  used  to  characterize  each  piece,  resulting  in  a  vector  sequence 
for  each  shape.  The  HMM  is  then  applied  to  classify  the  final  vector  sequence. 

Two  sets  of  data  were  used  to  test  the  algorithm.  Set  one  consisted  of  an  eight  class  problem 
with  30  images  per  class.  Of  the  images,  24  were  individually  drawn  with  moderate  boundary 
perturbations,  while  6  were  partially  occluded.  Testing  on  this  set  revealed  that  most  initial  cluster 
centers  gave  very  similar  results.  Recognition  rates  of  100%  were  achieved,  though,  and  the  authors’ 
note  that  the  correct  recognition  rate  has  a  tendency  to  increase  as  the  number  of  HMM  states  was 
increased.  Recognition  also  increased  with  higher  AR  model  order,  where  models  of  orders  4  and 
5  gave  the  best  results. 
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Set  two  consisted  of  four  country  maps  (Britain,  China,  Italy,  USA).  30  images  were  created 
for  each  class,  six  of  which  were  partially  occluded.  Subject  to  test  procedures  identical  to  those 
of  set  one,  the  same  overall  conclusions  were  obtained  for  set  two.  In  general,  the  HMM  was  found 
capable  of  handling  2-D  shapes  of  higher  complexity  by  increasing  the  number  of  model  states. 

2.4  Chapter  Summary 

This  literature  review  focused  on  a  subset  of  the  most  recent  studies  using  autoregressive 
features  for  image  recognition.  Expanding  previous  test  methodologies  to  include  nonplanar  views, 
this  research  will  be  based  on  results  of  the  articles  presented.  Displaying  exceptional  performance 
and  PSRI  properties,  the  CPARCOR  model  was  chosen  for  application  to  the  task  of  3-D  object 
recognition  using  nonplanar  2-D  object-views. 
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III.  Methodology 


S.  J  Introduction 

Citing  the  work  of  several  research  efforts,  Chapter  II  covered  a  subset  of  the  most  recent 
studies  using  autoregressive  features  for  image  recognition.  In  particular,  the  Complex  Partial 
Correlation  (CPARCOR)  method  was  noted  for  exceptionally  high  accuracy  when  its  features  were 
applied  to  recognition  of  planar  2-D  objects  corrupted  by  both  noise  and  partial  occlusion. 

This  chapter  covers  the  development  (Section  3.2),  validation  (Section  3.3),  and  i^pplication 
(Section  3.4)  of  the  CPARCOR  algorithm  used  by  this  thesis.  Along  with  the  fundamental  principles 
of  the  algorithm,  the  methodology  used  to  verify  both  the  main  program  and  supporting  subroutines 
is  described.  The  chapter  concludes  with  a  description  of  several  recognition  tests  conducted  using 
the  CPARCOR  algorithm  and  various  classifiers. 

S.2  CPARCOR  Algorithm  Development 

This  section  highlights  development  of  the  CPARCOR  algorithm  as  used  by  this  thesis. 
Derivation  of  the  CPARCOR  algorithm  is  also  explained  in  the  article  by  Sekita  and  others[17]; 
however,  only  the  most  important  equations  required  for  implementation  are  presented  here. 

Although  a  variety  of  autoregressive  shape  descriptors  exi8t[2][3][9][17],  few  are  able  to  match 
the  Position,  Scale,  and  Rotation  Invariant  (PSRl)  properties  of  the  Complex  PARCOR  features. 
Basically  em  extension  of  the  reid  PARCOR  coefficients  (for  speech  signal  processing),  the  CPAR¬ 
COR  features  are  calculated  from  sampled  boundaries  of  complicated,  non-convex  2-D  objects. 
Recognition  can  then  be  accomplished  using  a  Euclidean  metric  to  measure  distance  between  the 
coefficients. 
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Generation  of  the  CPARCOR  coefficients  begins  by  representing  each  point  of  an  object’s 
boundary  as  a  complex  number  of  form  Zj  —  Xj  +  ij/, .  In  this  manner,  a  vector  sequence  of 
complex  numbers,  Zj ,  is  obtained.  From  this  sequence,  a  Complex  AutoRegressive  (CAR)  model 
of  order  m  is  defined  by  a  linear  combination  of  the  m  preceding  boundary  points  [17],  or  as  shown 
below: 

m 

Zj  =  Y^akZj-k  +  >62  (3.1) 

k=\ 

where  Z  is  the  set  of  integers,  are  the  CAR  coefficients,  and  tj  represents  an  error  term. 

Based  on  this  model,  both  CAR  and  CPARCOR  coefficients  for  a  model  of  order  m  are  generated 
by  recursively  applying  the  following  three  equations: 


a(l)  =  n/ro 


a(m) 


a(m  -  1)  -  a(m  -  l)Ipm 
Pm 


(rm  -  r(m  -  l)'a(m  -  1)) 
P”*  "  (ro-r(m-l)^a(m-l)) 

The  individual  terms  for  Equations  3.2  to  3.4  are  defined  as  follows: 


(3.2) 


(3.3) 


(3.4) 


•  a(m)  =  [oi,  02, 03, ...,  Om]^  is  a  vector  of  CAR  coefficients. 

•  Pm  represents  the  mth  order  CPARCOR  coefficient. 

•  ric  =  complex  autocorrelation  of  Zj. 

0  r(m)  =  [ri,  r2,  rs, ...,  is  a  vector  of  complex  autocorrelation  coefficients. 

•  a  represents  the  complex  conjugate  of  a,  while  a^  is  the  vector  transpose  of  a. 

•  a(m  —  1)1  represents  the  vector  [om-t,Om-2f  — with  reverse  order  elements. 

•  r(m  -  !)•  is  the  transpose  of  r(m  -  1)^ 
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Figure  3.1  Initial  test  images;  (a)  Geometric  shapes  (b)  Typical  machine  parts. 

Based  on  results  of  algorithm-validation  tests  (Section  3.3),  it  should  be  noted  that  Equa¬ 
tion  3.3  was  published  incorrectly  in  the  original  article  by  Sekita  and  others  [17].  The  correct 
version  is  as  shown.  Although  the  CPARCOR  algorithm  generates  both,  the  CAR  coefficients  tend 
to  display  considerably  lower  recognition  rates  when  compared  to  the  CPARCOR  coefficients  (for 
identical  tests)  [17].  Thus,  only  the  CPARCOR  coefficients  will  be  considered  further. 

S.S  CPARCOR  Algorithm  Software  Validation 

The  following  subsections  provide  an  extensive  examination  of  the  CPARCOR  algorithm  used 
by  this  thesis.  Along  with  a  detailed  explanation  of  each  subroutine  in  Sections  3.3.1  to  3.3.4,  the 
methods  used  to  validate  the  algorithm  are  also  described  in  Sections  3.3.5  to  3.3.6. 

In  addition  to  the  primary  CPARCOR  routine  defined  by  Equations  3.2  to  3.4,  the  following 
preliminary  tasks/subroutines  were  required  in  order  to  generate  software  compatible  test  images: 

1.  Generation  of  Test  Images:  Creates  initial  and  primary  test  images. 

2.  Load  and  Enhance  Subroutine:  Creates  a  smooth,  unbroken  image  boundary. 

3.  Trace  Subroutine:  Traces  image  and  crdculates  the  boundary’s  centroid. 


4.  Sampling  Subroutine:  Provides  a  sampled  version  of  the  image’s  traced  boundary. 


(a)  M60  Tank  (b)  MSS  Truck  (c)  BTR  APC 

(d)  T62  Tank 

Figure  3.2  Primary  teat  images. 

S.3.1  Generation  of  Test  Images.  Test  image  generation  involved  a  two  step  process. 
First,  eight  different  classes  of  preliminary  test  data  were  generated  in  order  to  validate  both  the 
CPARCOR  algorithm  and  previous  research  results  [17].  Examples  of  the  initial  test  images  are 
shown  in  Figure  3.1.  Images  in  Set(b)  are  representative  of  the  actual  data  set  used  by  others 
[3] [17].  Scaled  (50%)  and  rotated  (30°)  versions  of  the  eight  cleisses  shown  were  also  created  to 
verify  the  PSRI  properties  of  the  CPARCOR  coefficients.  Initial  testing  was  conducted  using  four 
images  from  each  class  (32  total  images). 

Second,  in  order  to  attempt  three  dimensional  (3-D)  image  recognition,  2-D  representations 
of  3-D  images  had  to  be  created.  This  was  accomplished  with  the  aid  of  several  software  routines 
created  at  AFIT  [6]  and  the  Army’s  constructive  solid  geometry  based  computer  aided  design 
package,  BRL-CAD[1].  Images  (128  x  128  arrays)  were  generated  for  five  different  classes  which 
included  the  Army’s  M60  Tank,  M35  Truck,  BTR60  Armored  Personnel  Carrier(APC),  T62  Tank, 
and  M2  Infantry  Fighting  Vehicle(IFV).  A  representative  view  from  each  of  the  five  classes  is  shown 
in  Figure  3.2. 


(e)  M2  IFV 
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Figure  3.3  Typical  MATLAB  image  (M60  Tank). 

Image  views  were  generated  every  five  degrees  in  both  azimuth  (from  0“  to  355°)  and  elevation 
(from  0°  to  90°),  while  various  subsets  of  the  total  image  set  were  used  for  actual  test  applications. 
For  future  discussion,  all  views  will  be  referenced  by  the  following  format: 

CLASS-Azimuth-Elevation 

For  example,  the  front  view  of  an  M60  Tank  would  be  labeled  as  M60_0.0.  Similarly,  the  right  side 
of  an  M35  TYuck  would  be  labeled  as  M35j90j0. 

S.3.2  Load  and  Enhance  Image  Subroutine.  A  typical  image,  as  stored  in  a  MATLAB 
array,  is  shown  in  Figure  3.3.  For  compatibility  with  the  Trace  Subroutine  (Section  3.3.3),  the 
image  pixels  were  thresholded  for  two  grey  scale  levels  to  represent  either  a  background  pixel  (zero) 
or  a  target  pixel  (one). 

As  previously  mentioned,  the  image’s  boundary  must  be  traced  and  sampled  before  the  CPAR- 
COR  algorithm  can  be  applied.  Obviously,  it  would  be  trivial  for  a  human  to  trace  the  perimeter  of 
the  image  shown  in  Figure  3.3.  However,  notice  that  certain  gaps  exist  along  the  image’s  boundary 
due  to  the  use  of  a  128  X  128  format.  For  the  TVoce  Subroutine  to  accurately  follow  the  image’s 
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Figure  3.4  Effect  of  Mask  Subroutine  on  typical  image. 

boundary  without  getting  stuck,  these  gaps  need  to  be  filled  in.  Thus,  to  create  the  smooth, 
unbroken  boundary  required,  an  edge  detection  mask  is  first  applied  to  enhance  the  image  prior 
to  boundary  tracing.  Enhancing  both  horizontal  emd  vertical  lines  [13],  the  effects  of  the  edge 
detection  mask  are  shown  in  Figure  3.4. 

S.3.3  Trace  Subroutine.  Calculation  of  CPARCOR  coefficients  is  based  on  a  sampled 
version  of  the  original  image’s  boundary.  For  the  CPARCOR  algorithm,  sampling  is  performed 
on  on  the  object’s  boundary  using  a  simple  boundary  detect/trace  routine.  Based  loosely  on  the 
Turtle  algorithm  [3][4],  the  Trace  Subroutine  first  detects  and  then  follows  the  image’s  boundary  in 
a  clockwise  direction.  The  basic  algorithm  is  outlined  as  follows: 

1.  Scan  the  image  until  a  target  pixel  (one)  is  found. 

•  If  the  turtle  is  in  a  target  pixel,  turn  left  and  move  one  pixel  forward. 

•  If  the  turtle  is  in  a  background  pixel,  turn  right  and  move  one  pixel  forward. 

2.  Continue  tracing  the  boundary  until  the  turtle  has  found  the  starting  point 
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Figure  3.5  Effect  of  TVace  Subroutine  on  typical  image. 

As  discussed  in  the  previous  section,  two  conditions  must  be  satisfied  in  order  for  the  TVace 
Subroutine  to  function  properly.  First,  the  'mage  (to  be  traced)  must  be  clearly  defined  in  terms 
of  grey  scale  values.  This  is  accomplished  by  simply  thresholding  the  image  for  two  grey  scale 
levels,  either  background  pixel  (zero)  or  target  pixel  (one).  Second,  the  boundary  must  not  have 
any  spurious  gaps  that  will  cause  the  turtle  to  get  stuck  inside  the  image.  Again,  to  create  the 
unbroken  boundary  required,  ein  edge  detection  mask  is  applied  prior  to  boundary  tracing. 

The  Trace  Subroutine  returns  the  locations  of  all  boundary  pixels.  A  thinning  routine  is  then 
applied  to  eliminate  any  repeated  points  which  sometimes  occur  when  the  turtle  goes  around  a 
corner.  The  image’s  centroid  is  finally  calculated  from  the  thinned  sequence  of  boundary  pixels.  To 
insure  coefficient  invariance  to  boundary  translations,  the  centroid  is  used  to  represent  the  origin 
of  the  model’s  complex-coordinate  system.  Basically,  by  making  the  coefficients  invariant  to  both 
translation  and  rotation  about  the  origin,  they  are  automatically  invariant  to  any  rotation  [2][3][17]. 
Figure  3.5  shows  the  resulting  image  after  application  of  the  Trace  Subroutine. 
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Figure  3.6  Effect  of  Sampling  Subroutine  on  typical  image. 

3.5.4  Sampling  Subroutine.  After  the  centroid  has  been  calculated,  a  slightly  modified 
version  of  the  Trace  Subroutine  is  applied.  This  time,  the  image’s  coordinate  system  is  referenced 
from  the  centroid  (for  invariance).  The  modified  Trace  Subroutine  returns  a  sequence  contauning 
the  boundary’s  coordinates  (complex  numbers),  which  are  then  used  by  the  Sampling  Subroutine. 
Although  the  sampling  rate  can  be  user  specified,  it  is  recommended  that  the  number  of  samples 
be  fixed  to  a  constant  (for  each  class  of  images)  in  order  to  provide  better  PSRI  properties  [17]. 
In  fact,  no  significant  change  in  the  recognition  rate  according  to  the  number  of  representative 
boundary  points  was  noted  for  previously  reported  results  [17].  For  this  thesis,  sample  rates  were 
chosen  to  provide  an  accurate,  though  not  exact,  representation  of  the  2tctu«d  image.  For  example, 
30  seunples  were  used  for  tests  on  objects  shown  in  Figure  3.1,  while  160  samples  were  used  for  all 
objects  shown  in  Figure  3.2.  An  example  of  an  MOO’s  right-side  view  (M60^0-0)  recreated  from 
160  samples  is  shown  in  Figure  3.6. 

5.3.5  Coefficient  Stability  Teat.  As  outlined  at  the  start  of  this  chapter,  the  CPARCOR 
coefficients  of  higher  order  models  can  be  obtained  by  recursive  application  of  Equations  3.2  to 
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3.4.  To  verify  stability  of  the  coefficients  produced  by  these  equations,  a  complex  white  noise  test 
was  conducted  on  the  algorithm’s  transfer  function.  The  next  two  sections  will  discuss  the  transfer 
function’s  derivation  and  the  procedures  followed  for  the  white  noise  test,  respectively. 

3.3.5. 1  Transfer  Function  Derivation.  CPARCOR  coefficients,  an  extension  of  the 
real  PARCOR  coefficients,  are  a  direct  result  of  the  Complex  AutoRegressive  (CAR)  algorithm. 
Based  on  the  conventional  real  autoregressive  model,  a  CAR  model  of  order  m  is  defined  by 

m 

2;  =  j€Z  (3.5) 

k  =  l 

where  Z  is  the  set  of  integers,  att^i  CAR  coefficients,  and  ej  represents  an  error  term.  In 

general,  Zj  is  defined  as  a  linear  combination  of  the  preceding  m  boundary  points. 

Assuming  inputs  to  the  sequence  are  unknown,  an  estimate  of  zj  can  be  defined  by 

m 

(3.6) 

t=i 

The  error  between  the  estimate  and  the  actual  is  given  as 

e(n)  =  i(n)  -  z{n)  =  ej  (3.7) 

Rearranging  terms  and  taking  the  Z-transform[12][15]  of  Equation  3.5  yields: 


y(z)[i-X;at^’*]  =  X{^) 

k=l 

The  resulting  transfer  function  is  defined  as 


H(z)  -  Hi) _ J 

^  ^  Xiz)  ~ 


(3.8) 


(3.9) 
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S.S.5.2  White  Noise  Test  Procedures.  To  verify  stability  of  the  CPARCOR  coeffi¬ 
cients,  a  complex  white  noise  test  was  conducted  on  the  algorithm’s  transfer  function  (Equation  3.9) 


in  the  following  manner; 


1.  Generate  a  random  sequence  of  complex  white  gaussian  noise  (10,000  samples). 

2.  Generate  stable  (within  the  unit  circle)  test  coefficients  for  a  model  of  order  six  (arbitrary). 

3.  Filter  the  complex  white  noise  sequence  using  the  filter  described  by  the  test  coefficients  and 
autoregressive  transfer  function. 

4.  Apply  the  filter’s  output  vector  (impulse  response)  to  the  CPARCOR  algorithm. 

5.  Create  a  pole-zero  plot  of  the  calculated  CPARCOR  coefficients.  A  stable  sequence  will  result 
in  points  that  lie  within  the  unit  circle. 


Table  3.1  White  noise  test  coefficient  comparison. 


Order  \  Vector 

Original 

CPARCOR  Algorithm 

1 

2 

3 

-1.2578 

-1.2522  +  0.0048i 

4 

0.0226 

0.0231  -  0.0018i 

5 

0.0939 

0.0922  -  0.0025i 

6 

0.7165 

0.7138  -1-  0.0006i 

Table  3.1  provides  a  direct  comparison  of  the  original  test  coefficients  to  those  generated  by 
the  CPARCOR  algorithm.  The  coefficients  are  shown  to  be  a  reasonable  representation  of  the 
original  set.  As  shown  in  Figure  3.7,  all  poles  (coefficients)  also  lie  within  the  unit  circle.  Thus, 
the  algorithm  does  in  fact  produce  stable  CPARCOR  coefficients.  MATLAB  software  routines  and 
relevant  data  sets  used  for  this  test  are  provided  in  Appendix  B,  while  the  source  code  for  the 
CPARCOR  Algorithm  is  provided  in  Appendix  C. 
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Figure  3.7  Result  of  complex  white  noise  test  for  CPARCOR  algorithm. 

S.S.6  Algorithm  Verification  Teat.  This  section  details  the  initial  test  used  to  verify  oper¬ 
ation  of  the  entire  CPARCOR  algorithm.  Also  included  is  a  look  at  the  nature  of  the  CPARCOR 
coefficients  when  subject  to  changes  in  scale  and  rotation. 

As  highlighted  in  Chapter  II,  the  CPARCOR  algorithm  presents  an  exceptional  invariant- 
feature  extraction  model  for  recognition  of  arbitrary  planar  objects.  Previous  experimental  results 
indicated  that  complicated  and  putially  occluded  shapes  could  be  recognized  with  high  accuracy, 
even  with  low-order  models  [17].  Based  on  these  results,  several  simple  test  images  were  generated 
in  order  to  verify  the  proper  operation  of  the  CPARCOR  algorithm,  the  stated  PSRI  properties, 
and  the  outcomes  of  previous  research  efforts  [17].  Examples  of  the  initial  eight  class  test  set  were 
shown  in  Figure  3.1  on  Page  3-3.  Four  images  (including  scaled  and  rotated  versions)  were  created 
per  class,  for  a  total  of  32  images.  A  tenth  order  CPARCOR  model  was  created  for  each  image  (30 
boimdary-samples  each). 

Classification  of  the  initial  test  set  was  done  using  the  hold~one-out  method  and  a  l-Nearest- 
Neighbor  (INN)  classifier[10].  A  twenty  dimensional  feature  vector  was  created  (from  a  tenth  order 
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model)  for  each  class  by  separating  the  real  and  imaginary  parts  of  the  complex  coefficients  [17]. 
Consistent  with  previous  research,  one  hundred  percent  classification  results  were  achieved. 

The  next  logical  step  was  to  more  fully  explore  the  nature  of  the  CPARCOR  coefficients  when 
subject  to  changes  in  scale  and  rotation.  To  this  end,  a  plot  of  the  scaled  and  rotated  CPARCOR 
feature  vectors  was  generated  for  the  first  image  in  each  set  of  Figure  3.1.  Figure  3.8  (a)  is  a 
plot  of  the  original,  the  scaled(50%),  and  the  unscaled/rotated(30‘’)  feature  vectors  for  each  image. 
Although  some  intra-class  variation  exists,  the  two  classes  are  clearly  separable.  To  further  illustrate 
class  separability,  the  mean  of  the  three  feature  vectors  for  each  class  was  plotted  in  Figure  3.8  (b). 
This  figure  indicates  that  separability  continued  to  exists,  even  at  higher  order  models,  when  the 
image  was  subject  to  scale  and  rotational  changes. 

S.S.7  Section  Summary.  Section  3.3  provided  a  detailed  explanation  of  the  CPARCOR 
algorithm  as  well  as  the  methodology  used  for  its  validation.  The  algorithm  was  shown  capable 
of  producing  stable  CPARCOR  coefficients,  and  the  previously  reported  PSRI  properties  and  high 
recognition  rates  of  CPARCOR  features  [17]  were  verified.  Next,  Section  3.4  will  describe  the 
three  different  classifiers  used  to  examine  recognition  performance  of  the  CPARCOR  features  on 
nonplanar  2-D  images  (of  3-D  objects).  A  discussion  of  the  modifications  m2ule  to  the  methodology 
traditionally  applied  (when  classifying  by  autoregressive  models)  is  also  included 

3.4  Classifier  Application 

Although  the  Auto  Regressive  (AR)  model  was  previously  noted  to  perform  exceptionally 
well  when  applied  to  various  planar  2-D  object  recognition  tests,  the  goal  of  this  thesis  was  to 
demonstrate  application  of  the  CPARCOR  algorithm  to  spatio-temporal  recognition  of  3-D  objects. 
As  such,  the  CPARCOR  method  of  feature  extraction  was  applied  to  nonplanar  2-D  views  of  3- 
D  objects.  Recognition  based  on  the  CPARCOR  features  was  then  evaluated  using  the  several 
different  classification  techniques. 
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(a) 

lytatn  Valw  of  Soito  Mid  FMMlon  VMion 


(b) 

Figure  3.8  CPARCOR  coefficient  PSRI  comparisons,  (a)  Feature  vectors  for  scaled  and  rotated 
square  and  machine  part  (b)  Mean  value  of  square  and  machine  feature  vectors. 
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Providing  an  explanation  of  each  claasifier  applied  for  recognition  testing,  this  section  is 
broken  down  as  follows: 

•  Section  3.4.1  contains  a  discussion  of  several  modifications  made  to  the  methodology  tradition¬ 
ally  applied  when  classifying  by  autoregressive  models.  Also  highlighted  is  a  brief  sununary 
of  the  classifiers  applied  for  recognition  testing. 

•  Section  3.4.2  provides  an  extensive  explanation  of  the  Template  Matching  algorithm  that  was 
developed  specifically  for  this  thesis.  Included  in  Section  3.4.2. 1  are  the  baseline  hypotheses 
used  to  evaluate  the  template  algorithm’s  experimental  results.  Additionally,  Sections  3.4.2.2 
and  3.4.2.3  provide  details  of  specific  tests  conducted. 

s  Section  3.4.3  provides  a  discussion  of  the  Single-Look  and  Multiple-Look  1-NN  classifiers  used 
to  test  their  respective  recognition  techniques  on  unnormalized  CPARCOR  feature  vectors. 

s  Section  3.4.4  details  the  procedures  of  the  Hidden  Markov  Model  recognition  test.  For  clarity, 
examples  based  on  Fourier  features  are  included  in  the  discussion,  while  results  for  CPARCOR 
features  are  not  presented  until  Chapter  IV. 

A  discussion  of  the  results  for  all  tests  is  included  Chapter  IV.  For  clarity,  additional  results 
for  some  of  the  Template  Matching  algorithm  tests  are  included  in  Appendix  A. 

3.4.1  Modifications.  To  more  fully  examine  the  robust  nature  of  CPARCOR  coefficients, 
several  modifications  were  made  to  the  methodology  traditionally  applied  when  classifying  by  au¬ 
toregressive  models  [2][3][9][17]. 

First,  the  complexity  and  number  of  stored  images  (per  class)  was  significantly  increased 
to  determine  the  impact  of  crowding  the  feature  space.  The  images  of  Figure  3.1  on  Page  3-4 
are  representative  of  those  used  for  previous  research  efforts  [2][3][9][17].  Clearly,  they  pale  in 
comparison  to  the  complexity  of  the  test  images  shown  in  Figure  3.2  on  Page  3-4.  Also,  previous 
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efforts  typically  used  well  under  100  images  per  class.  This  thesis,  on  the  other  hand,  used  anywhere 
from  four  to  several  hundred  images  per  class  (depending  on  the  application). 

Second,  the  relationship  between  the  number  of  charnctenstic  views  and  the  corresponding 
recognition  rate  was  explored  through  application  of  a  Template  Classifier.  A  charaicteristic  view  is 
an  object  orientation,  p2uticular  to  a  given  class,  in  which  surrounding  views  yield  a  similar  set  of 
observed  features.  For  this  thesis,  the  minimal  number  of  characteristic  views  were  defined  as  the 
froni,  back,  top,  left,  and  right  side  views  of  each  class.  Basically,  various  dimensional  templates 
were  used  to  classify  views  not  stored  in  the  templates.  This  effort  was  essentially  an  extension  of 
previous  work  by  Seibert  and  Waxman  [16]. 

Third,  the  impact  of  partial  occlusion  on  the  corresponding  recognition  rate  was  examined  by 
using  known  levels  of  occlusion.  For  example,  image  occlusions  were  separated  into  three  distinct 
categories:  5%,  10%,  and  20%  levels  of  partial  occlusion.  Although  previous  researchers  included 
levels  of  up  to  60%  partial  occlusion  [9],  no  distinction  was  ever  made  (in  terms  of  recognition) 
between  the  effect  a  5%  versus  a  60%  level  of  occlusion. 

Finally,  all  classihcation  attempts  were  based  on  nonplanar  views.  Previous  research  efforts 
simply  rotated  the  same  image  in-plane  for  classification  testing  [2] [3] [9] [17].  For  this  research, 
however,  views  for  each  of  the  five  classes  (reference  Figure  3.2)  were  generated  in  five  degree 
increments  from  0°  to  355°  in  azimuth  and  0°  to  90°  in  elevation.  Images  were  generated  in  this 
manner  to  determine  if  a  spatio-temporal  relationship  existed  among  the  char!u:teristic  views  for 
each  class. 

To  evaluate  the  CPARCOR  coefficients  ability  to  provide  robust  PSRI  features  for  recognition 
of  nonplanar  2-D  images  (of  3-D  objects),  the  following  methods  of  classification  were  applied: 

•  Template  Matching  Algorithm:  Using  a  Euclidean  distemce  metric,  the  impact  of  unstored 

and  partially  occluded  views  on  single-frame  recognition  performance  was  examined.  Recall 
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that  unstored  views  were  defined  as  characteristic  views  that  are  not  included  (or  stored)  in 
the  respective  template. 

•  K- Nearest- Neighbor  (KNN)  Techniques:  Recognition  of  both  single  and  multiple  frames  of 
imagery  wjks  performed  using  the  hold-one-out  method  and  Single/Multiple-Look  1-NN  clas¬ 
sifiers.  A  direct  comparison  with  recognition  by  Fourier  features  was  made. 

•  Hidden  Markov  Model  (EMM):  Temporal  sequence  recognition  by  CPARCOR  features  was 
attempted  for  comparison  with  recognition  by  Fourier  features. 

S.jI.2  Template  Matching  Algorithm.  The  relationship  between  the  number  of  character¬ 
istic  views  and  the  corresponding  recognition  rate  was  explored  through  application  of  a  template 
classifier.  Providing  a  list  of  baseline  hypotheses  for  performance  evaluation,  this  section  will  ex¬ 
amine  the  basic  function  of  the  classifier  algorithm.  Also  included  is  an  explanation  of  the  tests 
conducted  on  data  sets  of  unstored  and  occluded  characteristic  views. 

Template  matching  is  a  simple,  but  effective  method  of  pattern  recognition.  Basically,  an 
input  of  unknown-class  is  compared  to  a  set  of  known-class  prototypes[18].  Recognition  (class 
identification)  is  determined  by  establishing  the  template’s  class  containing  the  closest  match  to 
the  unknown  input.  Of  the  various  means  available  to  establish  the  matching  criteria,  this  thesis 
will  use  a  Euclidean  distance  metric  defined  as  follows: 

D{x)  =  inin||x  -  Zi  ||  (3.10) 

where  ||  ■  ||  defines  the  Euclidean  norm  for  the  expression,  and  D{z)  is  the  smallest  of  all  distances 
between  the  unknown  input-view  (x)  and  the  ftb  stored  template-view  (z,')[18].  In  other  words, 
classification  is  based  upon  the  minimum  Euclidean  distance  between  a  known  template-view  and 
the  unknown  input-view.  Using  this  metric,  sever^  template  sets  were  developed  in  order  to  exam¬ 
ine  the  relationships  between  unstored/occluded  views  and  the  number  of  stored  (in  the  template) 
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characteristic  views.  Templates  were  based  on  m  x  n  matrices  containing  unnormalized  CPARCOR 
feature  vectors  that  represent  the  characteristic  views  for  various  allowable  object  orientations.  The 
following  templates  were  used  for  testing: 

•  360°  View  Template  (3x3  Matrix);  Contains  the  minimum  number  of  characteristic  views 
(CPARCOR  feature  vectors)  required  for  total-range  360°  azimuth  and  90°  elevation  image 
recognition. 

•  180°  View  Template  (2  x  3,5  x  9, 10  x  19  Matrix):  Representing  the  same  180°  view, 
the  three  matrices  contain  different  numbers  of  characteristic  views.  For  example,  the  2x3 
matrix  contains  six  characteristic  views,  while  the  10  x  19  contains  190. 

•  90°  View  Template  (2  x  2, 5  x  5, 10  x  10, 19  x  19  Matrix);  Although  representative  of  the 
same  90°  view,  the  2x2  matrix  uses  only  four  characteristic  views,  while  the  19  x  19  uses 
361.  Note  that  the  19  x  19  contains  the  maximum  number  of  characteristic  views  possible  for 
the  given  range.  Thus,  the  19  x  19  matrix  was  not  used  for  unstored  view  recognition  tests. 

An  illustrative  example  of  the  360°  View  Template  (3x3  Matrix)  is  shown  in  Figure  3.9 
on  Page  3-18.  Note  that  the  CPARCOR  feature  vectors,  not  the  images,  are  actually  stored  in 
the  template.  Although  other  possibilities  exist,  the  transitional  views  (in  each  corner)  for  this 
template  were  represented  by  the  following: 

•  Upper  left  corner:  Class^l5-45 

•  Upper  right  corner;  Class.45.45 

•  Lower  right  corner:  Class-135.45 

•  Lower  left  corner:  Class_225.45 

The  Template  Matching  algorithm  returns  the  matrix  location  and  corresponding  class  label 
of  the  best  match  for  a  given  unknown-input  view.  The  source  code  for  this  algorithm  and  the 
various  templates  is  included  in  Appendix  D. 


Figure  3.9  Characteristic  Views  in  360°  Template  (a)  Basic  template  format  (b)  Representative 
test  images  for  M60  Tank. 
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3.4- 2. 1  Baseline  Hypotheses.  The  Template  A/afcAtn^  algorithm  was  used  to  perform 
a  variety  of  experiments  on  test  sets  of  both  unstored  and  partially  occluded  characteristic  views 
of  the  five  classes  shown  in  Figure  3.2  (Page  3-4).  Experimental  results  were  evaluated  against  the 
following  baseline  hypotheses; 

•  Recognition  performance  should  increase  with  model  order  until  a  point  of  optimal  perfor¬ 
mance  is  achieved.  After  this  point,  no  addition  benefit  will  be  realized  by  using  higher  order 
models. 

•  For  a  given  model  order,  recognition  performance  should  decrease  as  the  number  of  available 
classes  is  increased. 

•  Object  symmetry  should  allow  for  a  reduction  in  overall  template  and  matrix  size.  For 
example,  the  360°  View  Template  should  provide  almost  identical  recognition  rates  as  the 
180°  View  Template. 

•  As  the  matrix  size  of  a  given  template  is  increased  (to  include  more  characteristic  views), 
recognition  performance  for  unstored  views  should  also  increase.  The  exception  to  this  rule 
would  be  recognition  of  partially  occluded  views  (see  next  item). 

•  Recognition  of  partially  occluded  views  should  decrease  ais  the  matrix  size  of  a  given  template 
is  increased.  Generally,  as  more  characteristic  views  are  added,  the  occluded  view  has  a  higher 
probability  of  being  misclassified  due  to  the  existence  of  many  similar  choices. 

•  Recognition  performance  should  decrease  as  the  percentage  of  partial  occlusion  is  increased 
from  5%  to  20%. 

3. 4- 2. 2  Matrix  Reductions  and  Vnsiored  View  Testing.  This  section  presents  the 
motivation  behind  an  attempt  to  reduce  the  overall  template  size  required  for  reliable  recognition. 
Initially  presented  is  an  examination  of  symmetrical  characteristic  views.  This  is  followed  by  an 
explanation  of  how  unstored  views  were  used  to  test  the  template/matrix  reduction  hypothesis. 
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Figure  3.10  Characteristic  Views  for  MOO  Tank. 

Based  on  the  assumptions  of  Section  3.4.2. 1,  preliminary  template  testing  was  focused  on 
ways  to  reduce  the  overaJl  template  size.  A  straight  forward  approach  to  this  problem  would  seem 
to  involve  using  an  object’s  symmetry  to  reduce  the  number  of  stond  characteristic  views.  Thus, 
a  plot  of  the  MOO  Tank’s  characteristic  views,  as  stored  in  the  300°  view  template,  was  made  in 
order  to  examine  the  CPARCOR  coefficients.  As  shown  in  Figure  3.10,  similar  feature  vectors  we 
indeed  generated  for  views  separated  by  180°  in  azimuth. 


Table  3.2  Unstored  view  test  set  (Class-AzimutbJllevation). 


Class-0-5 

Class.5.5 

Class-5.35 

Class_15-15 

Class.!  5^5 

Class-25J 

Class  J25-45 

ClassJO^O 

Class-35^5 

Clas8.45.15 

Class-45.45 

Class.45_75 

Class-55-25 

Class.55-55 

Class.00.85 

Class.05^5 

Class-05-45 

Clas8.05^5 

Clas8.75.15 

Class.75J5 

Clas8_85J 

Class.85w35 

Cla8s^5.45 

Class.90.15 

Class.90.j85 

Taking  advantage  of  this  information,  an  attempt  to  reduce  the  overall  template  size  was 
conducted  by  directly  comparing  the  recognition  accuracy  of  templates  for  300°,  180°,  and  90° 
views.  Sets  of  various  sized  matrices  were  used  to  test  the  recognition  rate  of  each  template  on 
a  given  test  set  of  unstored  views.  A  baseline  for  comparison  was  established  by  restricting  the 
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unstored  views  to  the  set  shown  in  Table  3.2.  Representative  results  of  the  matrix  reduction  and  the 
unstored  view  tests  are  fully  discussed  in  Chapter  IV.  For  clarity,  additional  results  were  included 
in  Appendix  A. 

3. 4. 2.3  Occlusion  Testing.  One  of  the  more  difficult  problems  associated  with  image 
recognition  is  the  classification  of  partially  occluded  object  view8[3].  This  section  will  explain  the 
logic  behind  partial  occlusion  tests  conducted  with  the  template  algorithm.  Initially  presented 
eire  results  of  a  test  performed  to  verify  previously  reported  experimental  results  [3][17].  Next,  an 
examination  of  actual  CPARCOR  feature  vectors  subject  to  various  levels  (0,  5,  10,  and  20%)  of 
occlusion  is  shown.  Finally,  the  methodology  used  specifically  for  this  thesis  is  explained. 


Table  3.3  Views  occluded  5,  10,  and  20%. 


M60-0-0 


M60-0-0(new) 


M6OJ0-9O 


M60_0-90(new) 


M60-90.0 


Before  examining  the  effects  of  known  levels  of  partial  occlusion,  test  results  reported  by 
others  [3][17]  were  verified  using  the  five  M60  Tank  views  shown  in  Table  3.3.  Note  that  only  the 
M60  class  was  used  for  this  test.  M60  views  were  occluded  5,  10,  and  20%  for  a  total  test  set 
of  15  occluded  views.  Two  occluded  versions  of  the  MfiO’s  front  and  right  side  views  (indicated 
by  ‘new’)  were  included  in  order  to  test  the  effect  moving  an  occlusion  to  a  new  location  on  the 
same  view.  Using  a  template  set  of  2  x  2  matrices  (one  per  class),  the  test  set  of  occluded  M60 
views  (Table  3.3)  was  applied  to  both  a  three  (M60,  BTR,  M35)  and  a  five  class  problem.  Shown 
in  Figure  3.11  (a),  results  of  the  three  class  comparison  re2Mlily  verify  the  high  recognition  rates 
reported  by  previous  efforts.  Note  that  the  recognition  rate  of  zero  for  order  one  was  valid  due  to 
the  use  of  only  one  class  for  testing.  The  significant  drop  in  recognition  for  the  five  class  case  was 
thought  to  be  due  to  confusion  caused  by  the  addition  of  two  more  tracked  vehicles  (M2,  T62).  To 
verify  this.  Figure  3.11  (b)  compares  the  results  of  the  five  class  problem  to  results  of  a  more  similar 
three  class  case  (M60,  M2,  T62).  Given  that  the  same  line  is  defined  for  both  cases,  the  drop  in 
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recognition  for  Figure  3.11  (a)  was  clearly  caused  by  confusion  related  to  the  two  additional  classes 
of  similar  tracked  vehicles.  Again,  the  zero  recognition  rate  (order  one)  was  valid  due  to  the  use  of 
a  one-class  test  set.  Previous  efforts  [3][17]  reported  higher  classification  rates  by  performing  tests 
on  several  less  similar  classes. 

A  plot  of  two  feature  vectors  for  occluded  M60  characteristic  views  was  made  in  order  to 
examine  CPARCOR  coefficients  subject  to  various  levels  (0,  5,  10,  and  20%)  of  occlusion.  As 
shown  in  Figure  3.12  (a)  on  Page  3-24,  similar  feature  vectors  are  indeed  generated  for  occlusions 
of  the  same  view.  To  further  illustrate  this  point,  a  plot  of  the  overall  mean  for  each  set  of  vectors 
was  made.  As  shown  in  Figure  3.12  (b),  the  two  characteristic  views  are  clearly  separable. 


Table  3.4  M60  and  M2  views  occluded  5,  10,  and  20%. 


M60-0_0 

M60_15,15 

M60-30-30 

M60.0-45 

M60.45_0 

M60.45.45 

M60.65.65 

M60-75-25 

M60_0J0 

M60J0-0 

M2.0-0 

M2-15-15 

M2-30J0 

M2.0.45 

M2.45J0 

M2-45.45 

M2-65J55 

M2-75^5 

M2.0J0 

M2-90JD 

Based  on  this  information,  the  impact  of  partial  occlusion  on  the  corresponding  recognition 
rate  wm  examined  by  using  known  levels  of  occlusion  and  various  template  sizes.  Separated  into 
three  distinct  categories  (5%,  10%,  and  20%  ),  60  occluded  images  (total)  were  made  for  the  M60 
Tank  and  the  M2  IFV  from  the  set  listed  in  Table  3.4.  The  two  classes  of  tracked  vehicles  were 
chosen  in  order  to  test  the  robustness  of  recognition  on  occluded,  but  similar  objects.  Also,  to 
make  recognition  more  difficult,  all  three-class  testing  was  performed  using  the  M60,  M2,  and  T62 
(tracked  vehicles).  Occlusions  were  ‘hand-made’  by  removing  a  number  of  image  pixels  (from  the 
128  X  128  array)  corresponding  to  the  desired  level  of  occlusion.  A  subset  of  occluded  M60  views 
is  shown  in  Figure  3.13  on  Page  3-26.  Chapter  IV  contains  representative  results  and  a  discussion 
of  these  tests.  For  clarity,  additional  results  were  included  in  Appendix  A. 

The  following  sections  present  the  methodology  applied  to  two  additional  classifiers  used  by 
this  thesis.  Section  3.4.3  provides  a  discussion  of  the  Single-Look  and  Multiple-Look  1-NN  classifiers 
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(b) 

Figure  3.11  Verification  of  previously  reported  occluded  view  recognition  rates.  Test  set  consisted 
of  occluded  M60  Tank  views  only.  Recognition  rates  for  (a)  Dissimilar  3  class  (M60, 
BTR,  M35)  (b)  Similar  3  class  (M60,  M2,  T62). 
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Figure  3.12  CPARCOR  coefficient  occlusion  comparisons,  (a)  Feature  vectors  for  M60  character¬ 
istic  views  occluded  0,  5,  10,  and  20%  (b)  Mean  value  of  occluded  feature  vectors. 
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used  to  test  their  respective  recognition  techniques  on  unnormalized  CPARCOR  feature  vectors. 
Similarly,  Section  3.4.4  details  the  procedures  of  the  Hidden  Markov  Model  recognition  test. 

3.4  s  K-Nearest-Neighbor  (KNN)  Tecknignes.  The  nearest-neighbor  rule  is  a  sub-optimal 
technique  that  will  typically  lead  to  an  error  greater  than  the  Bayes  rate,  the  minimum  possible 
[4].  For  this  thesis,  two  versions  of  a  1-NN  classifier  were  used  for  recognition  of  both  single-frame 
and  multiple-frame  sequences  of  uncorrupted  (no  noise/occlusion)  imagery.  Tests  were  conducted 
to  compare  the  effects  of  Single-Look  versus  Multiple-Look  recognition  on  unnormalized  CPARCOR 
feature  vectors.  A  five  class  problem  was  attempted,  using  images  from  the  set  of  Army  ground 
vehicles  shown  in  Figure  3.2  on  Page  3-4.  For  each  class,  50  randomly  generated  image  sequences 
of  lengths  14,  16,  18,  and  20  frames  each  (200  total  sequences)  were  created.  For  a  view  area  of 
0°  to  180°  azimuth  and  0°  to  75°  elevation,  sequential  frames  were  allowed  to  differ  (or  remain  the 
same)  by  5°  in  azimuth  or  elevation.  In  this  manner,  3400  individual  image  frames  were  created  for 
each  class  (17,000  total  frames).  A  tenth  order  CPARCOR  model,  chosen  for  compatibility  with 
Fourier  results,  was  then  created  for  each  image. 

For  the  Single-Look  1-NN  test,  the  hold-one-out  method  was  used  to  sequentially  hold  out 
a  single  frame  of  imagery  for  comparison  with  the  remaining  16,999.  The  hold-one-out  method 
provides  an  upper  bound  to  the  Bayes  error  rate,  yielding  a  worst-case  estimate  of  the  error  when 
generalizing  on  unseen  data[8].  Classification  was  then  based  on  the  class  containing  the  single  best 
matched  frame  to  the  one  withheld.  Although  following  the  same  initial  procedures,  classification 
for  the  Multiple-Look  1-NN  was  based  on  the  class  containing  the  majority  of  best  matched  frames 
per  sequence,  not  per  frame.  The  results  of  these  tests,  along  with  a  direct  comparison  with 
recognition  by  Fourier  features,  are  discussed  in  Chapter  IV. 

3.4-4  Hidden  Markov  Model  (HMM)  Testing.  To  determine  if  CPARCOR  features  could 
accurately  represent  temporally  encoded  sequences  of  images,  a  Hidden  Markov  Model  (HMM) 
classifier  was  applied.  The  HMM  classifier  had  previously  displayed  exceptional  recognition  rates 
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Figure  3.14  Example  of  Mean  Distortion  Curves  (MDC)  for  cluster  centers  (Fourier  features 
shown). 

(over  97%)  for  temporal  sequences  represented  by  a  low  frequency,  Fourier  magnitude  feature  set 
[7].  Thus,  a  direct  5  class  performance  comparison  was  attempted  by  simply  replacing  the  Fourier 
coefficients  with  a  set  of  CPARCOR  coefficients.  The  data  set  consisted  of  the  live  Army  ground 
vehicles  previously  shown  in  Figure  3.2. 

Based  on  a  viewer  centered  approach,  the  EMM  used  the  Linde,  Buzo,  and  Grey  (LBG)[11] 
clustering  algorithm  to  create  a  vector  quantizer  whose  clusters  corresponded  to  areas  (on  a  viewing 
sphere)  of  similar  characteristic  view[7].  For  test  purposes,  the  region  of  interest  was  restricted  to 
a  0°  to  180°  azimuth,  0°  to  75°  elevation  portion  of  the  viewing  sphere.  592  images  per  class 
were  generated  for  this  view  region.  Both  the  CPARCOR  and  Fourier  coefficients  were  statistically 
normalized  across  all  feature  vectors  to  produce  zero  mean,  unit  variance  features. 

A  codeword  vector  quantizer  was  produced  by  processing  the  feature  vectors  with  the  LBG 
algorithm.  The  result  was  a  cluster  distortion  versus  codebook  size  plot  as  shown  (for  Fotirier  feature 
set)  in  Figure  3.14.  Based  on  this  data,  a  64  codeword  vector  quantizer  was  chosen  for  use  with 
the  Fourier  data.  The  same  size  quantizer  was  applied  to  the  CPARCOR  data  for  consistency  in 
technique  comparisons. 
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Figure  3.15  Example  of  sphere  mapping  (Fourier  features  shown). 

Sphere  plots  were  used  to  associate  the  mapping  of  each  viewing  position  to  a  particular 
codebook  entry.  Basically,  each  viewing  position  was  associated  with  a  characteristic  view.  An 
example  of  the  viewing  spheres  for  the  Fourier  data  (CPARCOR  results  are  shown  in  Chapter  IV) 
is  shown  in  Figure  3.15.  Areas  of  common  shading  represent  a  shared  characteristic  view  or  aspect. 
Recognition  ambiguity  is  seen  to  exist  when  more  than  one  object  class  occupies  the  same  cluster. 

Respective  class  HMMs  were  trained  on  the  same  200,  randomly  generated  sequences  used 
for  the  Single  and  Multiple-Look  I\- Nearest- Neighbor  (KNN)  tests  (Section  3.4.3).  Although  not 
expected  to  have  any  great  impact  on  the  HMM  test,  it  should  be  noted  that  128  x  128  pixel  arrays 
were  used  for  the  CPARCOR  images,  while  256  x  256  arrays  were  used  for  the  Fourier  images.  A 
sphere  plot  example  of  five  of  the  randomly  generated  training  sequences  is  shown  in  Figure  3.16 
on  Page  3-29.  Results  from  this  direct  comparison  of  CPARCOR  and  Fourier  features  are  fully 
discussed  in  Chapter  IV. 
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Figure  3.16  Example  of  sphere  trajectories. 


3.5  Chapter  Summary 

The  development  (Section  3.2),  validation  (Section  3.3),  and  application  (Section  3.4)  of  the 
CPARCOR  algorithm  has  been  presented.  Along  with  fundamental  principles  of  the  algorithm,  the 
methodology  used  to  verify  both  the  main  program  zuid  supporting  subroutines  was  described.  The 
chapter  concluded  with  a  description  of  several  recognition  tests  conducted  using  the  CPARCOR 
algorithm  and  various  classifiers.  Chapter  IV  discusses  the  results  of  these  tests. 


IV.  Results  and  Discussion 


Chapter  HI  covered  the  development,  validation,  and  application  of  the  Complex  Partial 
Correlation  (CPARCOR)  algorithm.  Along  with  fundamental  principles  of  the  algorithm,  the 
methodology  used  to  verify  both  the  main  program  and  supporting  subroutines  was  described.  The 
chapter  concluded  with  a  description  of  several  recognition  tests  conducted  using  the  CPARCOR 
algorithm  and  various  classifiers. 

This  chapter  provides  a  discussion  of  the  results  for  tests  described  in  Chapter  III.  Presented 
in  the  same  manner  as  in  Chapter  III,  the  following  topics  are  considered: 

•  Template  Teat  ReauHa.  This  section  provides  a  discussion  of  the  results  for  tests  conducted 
with  the  Template  Matching  algorithm.  Although  numerous  tests  were  conducted,  only  rep¬ 
resentative  results  are  shown  (for  clarity)  in  this  chapter.  Appendix  A  contains  all  remaining 
test  results. 

•  KNN  Teat  Reanlta.  Single  and  Multiple-Look  1-NN  classifier  results  are  discussed  in  this 
section.  A  direct  comparison  to  recognition  by  Fourier  features  is  ^d80  made. 

•  EMM  Teat  Reanlta.  Following  a  brief  review  of  the  methodology  attempted  for  HMM  testing, 
a  look  at  why  the  CPARCOR  model  failed  to  train  is  presented. 

4.1  Template  Teat  Results 

After  a  brief  review  of  the  baseline  hypotheses  used  for  performance  evaluation,  this  section 
will  discuss  the  results  of  tests  conducted  with  the  Template  Matching  algorithm.  For  the  sake  of 
clarity,  this  chapter  will  only  show  representative  results  for  the  matrix-reduction/unstored-view 
tests  (Section  4.1.1),  as  well  as  the  occluded  view  tests  (Section  4.1.2).  Appendix  A  contains  all 
remaining  results  for  these  tests. 
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Using  a  Euclidean  distance  metric,  the  relatiooship  between  the  number  of  characteristic  views 
and  the  corresponding  recognition  rate  was  explored  through  application  a  template  classifier. 
Template  sets  were  based  on  m  x  n  matrices  containing  unnormalixed  CPARCOR  coefficient  vec¬ 
tors.  The  following  baseline  hypotheses,  originally  presented  in  Chapter  III,  were  used  to  evaluate 
template  recognition  performance; 

•  Recognition  performance  should  tncnaae  with  model  order  until  a  point  of  optimal  perfor- 
mamce  is  achieved.  After  this  point,  no  addition  benefit  will  be  realized  by  using  higher  order 
models. 

•  For  a  given  model  order,  recognition  performance  should  deertase  as  the  number  of  available 
classes  is  increased. 

m  Object  symmetry  should  allow  for  a  reduction  in  overall  template  and  matrix  size. 

•  As  the  matrix  size  of  a  given  template  is  increased  (to  include  more  characteristic  views), 
recognition  performance  for  unstored  views  should  also  tncrease.  The  exception  to  this  rule 
would  be  recognition  of  partisdly  occluded  views  (see  next  item). 

•  Recognition  of  partially  occluded  views  should  decrease  as  the  matrix  size  of  a  given  template 
is  increased. 

•  Recognition  performance  should  decrease  as  the  percentage  of  partial  occlusion  is  increased 
from  5%  to  20%. 

4.1.1  Matrix  Reduction  and  Unstored  View  Testing.  This  section  examines  the  results 
of  the  matrix  reduction  and  unstored  view  tests.  Motivated  by  the  results  of  Figure  3.10  (Sec¬ 
tion  3.4.2. 2),  various  tests  were  conducted  to  demonstrate  that  object  symmetry  should  allow  for  a 
reduction  in  overall  template  and  matrix  size.  For  example,  the  360°  View  Template  was  expected 
to  provide  almost  identical  recognition  rates  as  the  90°  View  Template  for  views  in  the  90°  az- 
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imuth/elevation  region.  Applying  the  unstored  view  test  set  (Table  3.2)  to  the  360°,  180°,  and  90° 
View  Templates,  this  hypothesis  was  verified  by  directly  comparing  recognition  rates. 

To  illustrate  the  feasibility  of  symmetry-related  template  reductions,  unstored  view  recogni¬ 
tion  rates  for  the  360°  and  the  90°  view  templates  were  directly  compared  for  a  three  class  problem 
as  shown  in  Figure  4.1  on  Page  4-4.  Except  for  some  obvious  variation  in  (c),  overall  recognition 
rates  were  practically  identical.  Extending  this  idea  to  a  five  class  case,  Figure  4.2  on  Page  4-5 
indicates  that  similar  results  were  again  obtained.  Notice,  though,  that  rates  are  significantly  lower 
for  the  five  class  case  due  to  the  addition  of  two  classes  of  tracked  vehicles.  Although  not  shown 
here  (see  Appendix  A),  recognition  rates  were  typically  higher  when  larger  matrix  sizes  were  used. 
Similar  results  were  also  found  for  comparisons  between  the  360°  and  180°  templates  as  well  as 
between  the  180°  and  90°  templates.  Plots  for  these  results  can  be  found  in  Appendix  A.  Thus, 
assuming  object  symmetries  do  exists,  this  method  of  comparison  clearly  validated  template  re¬ 
duction  as  a  viable  option.  Obvious  advantages  of  template  reduction  include  a  decrease  in  the 
number  of  characteristic  views  and  the  processing  time  required  for  recognition. 

A  comparison  of  90°  template  recognition  rates  (using  various  matrix  sizes)  for  three  and  five 
classes  is  shown  in  Figures  4.3  and  4.4  (Pages  4-7  and  4-8).  Although  recognition  performance 
was  generally  found  (for  all  tests)  to  decrease  as  the  number  of  available  classes  was  increased, 
recognition  performance  was  not  seen  to  increase  with  model  order  in  a  consistent  manner.  Instead, 
rates  tended  to  fluctuate  randomly  with  model  order.  Also,  unstored  view  recognition  rates  did 
not  necessarily  increase  with  an  increase  in  matrix  size  for  a  given  template.  In  Figure  4.4  (a), 
for  example,  consistent  recognition  (relative  to  matrix  size)  at  lower  orders  is  seen  to  suddenly 
contradict  itself  after  model  order  12.  At  this  point,  the  2x2  matrix  displays  higher  recognition 
rates  than  the  5  x  5  or  the  10  x  10.  These  two  discrepancies  were  evident  in  other  tests  as  well. 
Thus,  the  two  relevant  hypotheses  were  considered  false. 
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Figure  4.1  Comparison  of  90°  and  360°  template  for  unstored  view  recognition  results  with  3 
classes. 
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(e)  M2  IFV 


Figure  4.2  Comparison  of  90°  and  360°  template  for  unstored  view  recognition  results  with  5 
classes. 
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Recall  that  the  data  set  applied  to  these  tests  consisted  of  unstored  views,  which  were  defined 
as  characteristic  views  not  stored  in  the  respective  template.  This  means  that,  depending  on  the 
template  used,  test  views  could  have  easily  varied  by  as  much  as  30°  (or  more/less)  in  both  azimuth 
and  elevation  relative  to  the  stored  views  in  the  template.  Significant  in  a  statistical  sense,  assuming 
equally  likely  occurrences,  was  that  the  overall  recognition  trends  using  these  views  were  typically 
much  higher  than  that  of  pure  chance.  For  example,  using  the  10  x  10  matrix,  peak  recognition 
values  of  70%  and  80%  are  seen  to  occur  (5  class  case),  while  no  rate  lower  than  24%  is  noted. 
As  expected,  rates  were  shown  even  higher  for  the  three  class  cases  where  less  similarity  between 
claisses  occurred.  Along  these  lines,  higher  recognition  rates  would  have  also  been  expected  had  the 
original  test  set  been  modified  to  include  several  sfored-views.  In  general,  though,  the  recognition 
rates  from  these  tests  verified  that  reasonably  accurate  cletssification  based  on  a  limited  number  of 
stored  characteristic  views  was  possible. 

4-1.2  Occlusion  Testing.  Following  a  brief  review  of  the  test  procedures  and  baseline 
hypotheses,  this  section  presents  recognition  results  from  tests  conducted  using  partially  occluded 
object-views.  For  clarity,  only  representative  test  results  for  the  M60  Tank  shall  be  presented 
here.  Additional  results,  including  tests  conducted  on  occluded  M2  IFV  views,  are  included  in 
Appendix  A. 

Motivated  by  the  successful  results  of  Figure  3.11  (Section  3.4.2.3),  various  tests  were  con¬ 
ducted  using  known  levels  of  occlusion  to  examine  the  impact  of  partial  occlusion  on  the  corre¬ 
sponding  recognition  rate.  Separated  into  three  distinct  categories  (5%,  10%,  and  20%  ),  occluded 
images  were  made  for  both  the  MOO  Tank  and  the  M2  IFV.  The  two  classes  of  tracked  vehicles  were 
chosen  in  order  to  test  the  robustness  of  recognition  on  occluded,  but  similar  objects.  A  subset 
of  ‘hand-made’  occluded  MOO  views  was  previously  shown  in  Figure  3.13  (Page  3-20).  For  more 
realistic  recognition  testing,  all  three  class  problems  consisted  of  the  following  tracked  vehicles:  MOO 
Tank,  the  M2  IFV,  and  the  T02  Tank.  Based  on  preliminary  test  results  using  this  three  class  set 
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(c)  M35  IVuck 


Figure  4.3  Unstored  view  recognition  results  for  3  classes  and  various  90°  view  templates. 
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(a)  M60  Tank  (b)  M35  Truck 
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Figure  4.4  Unstored  view  recognition  results  for  5  classes  and  various  90°  view  templates. 
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(see  Appendix  A),  little  variation  was  expected  to  exist  between  three  and  five  class  comparisons. 
For  test  purposes,  60  occluded  images  (total)  were  made  for  the  M60  Tank  zmd  M2  IFV  from  the 
data  set  previously  shown  in  Table  3.4  (Page  3-22).  Recognition  rates  for  the  occluded  views  were 
evaluated  against  the  following  baseline  hypotheses: 

•  Recognition  of  partially  occluded  views  should  decrease  as  the  matrix  size  of  a  given  template 
is  increased. 

•  Recognition  performance  should  decrease  as  the  percentage  of  partial  occlusion  is  increased 
from  5%  to  20%. 

Results  for  both  three  and  five  class  tests  using  the  occluded  M60  view  data  set  are  shown  in 
Figure  4.5  (Page  4-10)  and  Figure  4.6  (Page  4-11),  respectively.  The  similarity  between  recognition 
rates  for  the  two  figures  was  due  to  use  of  the  stated  tracked  vehicles  (three  class  problem).  Although 
similar  results  are  also  shown  in  Appendix  A  (Figure  A. 10),  neither  hypotheses  was  shown  to  be 
valid.  For  example,  instead  of  displaying  a  linear  relationship,  recognition  rates  were  rather  constant 
no  matter  what  level  of  occlusion  was  applied.  Also,  even  though  recognition  of  putially  occluded 
views  should  have  decreased  as  the  matrix  size  was  increased,  peak  recognition  levels  were  noted 
to  occur  (in  all  cases)  for  the  19  x  19  matrix.  Thus,  both  of  the  original  hypotheses  were  shown 
to  be  false.  In  spite  of  this,  however,  the  consistently  high  recognition  rates  noted  for  all  tests 
demonstrated  that  CPARCOR  features  were  capable  of  reasonably  accurate  classification. 

Although  not  shown  until  Section  4.3  (Figure  4.9),  such  unexpected  results  were  found  to 
be  caused  by  an  unusual  mapping  of  the  coefficients  in  the  feature  space.  An  overcrowding  had 
occurred  in  which  feature  vectors  were  basically  right  on  top  of  one  another.  Keep  in  mind  that 
classification  was  being  performed  using  a  Euclidean  distance  metric.  As  such,  recognition  was 
based  on  the  closest  overall  match  within  the  template,  not  necessarily  the  closest  matching  view 
of  similar  azimuth  and  elevation.  Thus,  it  was  quite  possible  that  an  M60  view  occluded  20%  could 
be  a  better  match  for  the  M60  class  than  the  same  view  occluded  only  5%. 
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(c)  (d) 


Figure  4.5  Three  class  recognition  rates  for  occluded  M60  views.  Various  matrix  sizes  used  are 
(a)  2  X  2  Matrix  (b)  5  x  5  Matrix  (c)  10  x  10  Matrix  (d)  19  x  19  Matrix. 
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(a) 


(b) 


(c) 


(d) 


Figure  4.6  Five  class  recognition  rates  for  occluded  M60  views.  Various  matrix  sizes  used  are  (a) 
2x2  Matrix  (b)  5  x  5  Matrix  (c)  10  x  10  Matrix  (d)  19  x  19  Matrix. 
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4.1.3  Discussion  of  Template  Test  Results.  This  section  provides  a  summary  of  the  results 
for  tests  conducted  with  the  Template  Matching  algorithm.  Following  a  discussion  of  the  template 
reduction  and  unstored  view  tests,  results  from  the  occlusion  tests  are  considered. 

Results  from  the  template  reduction  and  unstored  view  tests  clearly  verified  that  object 
symmetry  allowed  for  a  reduction  in  overall  template  and  matrix  size.  This  is  significant  in  that 
a  decreased  in  the  number  of  characteristic  views  and  the  processing  time  required  for  recognition 
can  be  realized.  Also  validated  was  the  hypothesis  that,  for  a  given  model  order,  recognition 
performance  should  decrease  as  the  number  of  available  classes  was  increased.  This  generally 
accepted  result  is  based  on  the  idea  that  classifiers  have  a  greater  chance  of  confusing  objects  as 
the  number  of  classes  is  increased.  Unexpected  were  the  recognition  rates  for  increased  model  order 
and  matrix  size.  As  a  result,  the  CPARCOR  coefficients  were  shown  to  contradict  two  of  the  initial 
hypotheses.  Although  relatively  high  recognition  rates  were  noted  (given  unstored  view  test  set)  an 
optimal  CPARCOR  model  order  and  corresponding  template  size  could  not  be  determined  strictly 
based  on  these  test  results.  In  a  statistical  sense,  though,  overall  recognition  trends  were  shown  to 
be  much  higher  than  that  of  pure  chance.  Peak  recognition  values  of  70%  and  80%  were  noted  even 
for  the  five  class  case.  Considering  that  the  test  set  consisted  of  unstored  views,  the  recognition 
rates  clearly  verified  that  reasonably  accurate  classification  based  on  a  limited  number  of  stored 
characteristic  views  was  possible. 

Occluded  view  testing  revealed  some  rather  unexpected  results.  In  fact,  both  of  the  original 
hypotheses  used  to  evaluate  the  tests  were  shown  to  be  false.  For  example,  recognition  rates  were 
rather  constant  no  matter  what  level  of  occlusion  was  applied,  and  recognition  of  partially  occluded 
views  did  not  decrease  in  a  consistent  manner  as  the  matrix  size  was  increased.  In  addition  to  this, 
an  unusual  feature-space  mapping  was  noted  in  which  feature  vectors  were  basically  right  on  top  of 
one  another.  In  spite  of  the  seemingly  negative  results  for  this  series  of  tests,  overall  recognition  rates 
were  relatively  high.  Although  ein  optimal  CPARCOR  model  order  and  corresponding  template 
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size  could  not  be  determined,  recognition  rates  well  over  50%  were  displayed  for  all  tests  considered. 
Noting  that  the  test  sets  consisted  of  three  rather  similar  tracked  vehicles,  the  CPARCOR  features 
were  considered  to  exhibit  reasonably  accurate  classification  capabilities.  Fin2dly,  it  is  interesting 
to  note  that  previous  research  always  combined  all  levels  of  occlusion  together  for  test  purposes. 
This  test  was  one  of  the  first  to  display  the  effect  of  individual  known  levels  of  occlusion. 

The  following  sections  discuss  results  from  the  two  additional  classifier  tests.  Section  4.2 
presents  the  results  of  the  K- Nearest- Neighbor  tests,  while  Section  4.3  provides  the  results  for 
Hidden  Markov  Model  testing. 

4-2  K- Nearest- Neighbor  (KNN)  Test  Results 

Following  a  brief  review  of  the  methodology  used  for  K- Nearest- Neighbor  testing,  results  for 
both  the  Single- Look  and  Multiple- Look  1-NN  classifiers  are  discussed.  A  direct  comparison  to 
recognition  by  Fourier  features  is  also  made. 

Two  versions  of  a  1-NN  technique  were  used  for  recognition  of  both  single  and  sequential 
frames  of  uncorrupted  (no  noise/occlusion)  imagery.  Tests  were  conducted  to  compare  the  effects 
of  Single-Look  versus  Multiple-Look  recognition  on  unnormalized  CPARCOR  features.  A  five  class 
problem  was  attempted,  using  images  from  the  set  of  Army  ground  vehicles  previously  shown  in 
Figure  3.2  (Page  3-4).  For  each  class,  50  randomly  generated  image  sequences  of  lengths  14,  16, 
18,  and  20  frames  each  (200  total  sequences)  were  created.  For  the  view  area  of  0°  to  180°  azimuth 
and  0°  to  75°  elevation,  sequential  frames  were  allowed  to  differ  (or  remain  the  same)  by  only  5° 
in  azimuth  or  elevation.  In  this  m^ulner,  a  total  of  3400  individual  image  frames  were  created  for 
each  class  (17, f  ial  frames).  For  compatibility,  an  unnormalized,  tenth  order  CP/  RC  JR  model 
was  transformed  {real  and  imaginary  parts  separated)  into  a  twenty  dimensional  feature  vector  to 
represent  each  image. 
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Table  4.1  Single- Look  test  results  for  CPARCOR  features. 


Input  \  Classify 

M60 

M35 

BTR 

T62 

M2 

Percent  Correct 

M60  Teink 

2149 

165 

44 

921 

121 

63 

M35  Truck 

54 

2332 

626 

65 

323 

69 

BTR  APC 

25 

669 

2440 

33 

233 

72 

T62  Tank 

790 

107 

58 

2305 

140 

68 

M2  Tank 

60 

467 

260 

84 

2529 

74 

Table  4.2  Single-Look  test  results  for  Fourier  features. 


Input  \  Classify 

M60 

M35 

BTR 

T62 

M2 

Percent  Correct 

M60  Tank 

3398 

0 

0 

1 

1 

99.9 

M35  Truck 

0 

3391 

9 

0 

0 

99.7 

BTR  APC 

0 

9 

3391 

0 

0 

99.7 

T62  Tank 

2 

0 

1 

3396 

1 

99.9 

M2  Tank 

0 

0 

0 

5 

3395 

99.8 

4. 2.1  Single- Look  1-NN  Testing.  Single-Look  1-NN  testing  used  the  hold-one-out  method 
to  sequentially  hold  out  a  single  frame  of  imagery  for  comparison  with  the  remaining  16,999. 
Classification  was  then  based  on  the  class  containing  the  single  best  matched  frame  to  the  one 
withheld.  Results  of  this  test  are  shown  in  Table  4.1.  Although  achieving  m  overall  accuracy 
of  69%,  recognition  was  generally  considered  poor  since  no  image  corruption  was  performed.  For 
comparison,  results  based  on  the  use  of  Fourier  features  are  provided  in  Table  4.2.  Results  are 
notably  higher,  with  an  average  recognition  rate  of  99.8%. 


Table  4.3  Multiple-Look  test  results  for  CPARCOR  features. 


Input  \  Classi  fy 

M60 

M35 

BTR 

T62 

M2 

Percent  Correct 

M60  Tank 

181 

0 

0 

12 

B 

90 

M35  TVuck 

0 

198 

0 

0 

2 

99 

BTR  APC 

0 

9 

191 

0 

0 

95 

T62  Tank 

7 

0 

0 

188 

5 

94 

M2  Tank 

0 

0 

0 

0 

200 

100 
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Table  4.4  Multiple-Look  test  results  for  Fourier  features. 


Input  \  Classify 

M60 

M35 

BTR 

T62 

M2 

Percent  Correct 

M60  Tank 

200 

0 

0 

0 

0 

100 

M35  Truck 

0 

200 

0 

0 

0 

100 

BTR  APC 

0 

0 

200 

0 

0 

100 

T62  Tank 

0 

0 

0 

200 

0 

100 

M2  Tank 

0 

0 

0 

0 

200 

100 

4.2.2  MuUiple-Look  J~NN  Testing.  Following  the  same  initial  procedures,  classification 
for  the  Multiple-Look  1-NN  was  based  on  the  class  containing  the  majonfy  of  best  matched  frames 
per  sequence,  not  per  frame.  Results  of  this  test  using  CPARCOR  coefficients,  shown  in  Table  4.3, 
are  significantly  higher  than  for  the  respective  1-NN  test.  Although  not  as  clearly  demonstrated 
by  the  Fourier  features  (Table  4.4),  the  dramatic  improvement  is  easily  noted  for  the  CPARCOR 
features.  An  overall  accuracy  of  96%  was  achieved,  indicating  a  decisive  advantage  of  sequential 
classification  over  the  previous  single-frame  techniques  of  Template  Matching  and  Single-Look  1- 
NN.  Statistically  speaking,  however,  the  results  are  not  all  that  surprising.  Assuming  independence 
of  class  views,  recognition  rates  are  generally  expected  to  increase  as  more  frames  are  considered 
for  classification.  Also  note  that  for  this  particular  classifier,  sequence  order  was  irrelevant.  In 
other  words,  classification  results  would  have  been  the  same  no  matter  how  the  individual  frrunes 
were  arranged  in  the  sequences. 

4.2.3  Discussion  of  KNN  Test  Results.  The  advantage  of  classification  by  a  multiple- 
look  technique  over  the  traditional  single-look  method  was  clearly  demonstrated  by  the  KNN  tests. 
Based  on  a  direct  comparison  between  CPARCOR  and  Fourier  features,  the  multiple-look  technique 
was  noted  to  provide  higher  classification  rates  for  each  case.  Although  recognition  rates  by  Fourier 
features  were  notably  higher  than  those  of  the  CPARCOR  features,  the  actual  application  should 
be  considered  before  deciding  on  which  method  to  use.  For  example,  the  PSRI  properties  of  the 
CPARCOR  features  are  inherent,  while  additional  signal  processing  is  required  in  order  to  achieve 
similar  results  with  Fourier  features.  Also,  the  CPARCOR  features  may  be  more  readily  capable 
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of  dealing  with  parti2d  occlusions  than  Fourier  features.  For  either  case,  though,  it  has  been  shown 
that  classification  should  be  performed  based  on  multiple-look  techniques. 

The  next  section  provides  a  discussion  of  the  Hidden  Markov  Model  test.  A  direct  comparison 
of  recognition  by  CPARCOR  and  Fourier  features  was  attempted. 

4.3  Hidden  Markov  Model  (HMM)  Test  Results 

Although  successful  results  were  obtained  for  the  previous  classifier  tests,  the  CPARCOR  fea¬ 
tures  failed  to  provide  geographically  oriented  sphere  clustering  properties.  As  such,  a  direct  apples 
to  apples  comparison  of  recognition  by  CPARCOR  and  Fourier  features  could  not  be  performed. 
Thus,  following  a  brief  review  of  the  methodology  attempted  for  HMM  testing,  a  look  at  why  the 
CPARCOR  model  failed  to  train  is  presented. 

A  Hidden  Markov  Model  (HMM)  classifier  was  applied  to  determine  if  CPARCOR  features 
could  accurately  represent  temporally  encoded  sequences  of  images.  The  HMM  classifier  had  pre¬ 
viously  displayed  exceptional  recognition  rates  (over  97%)  for  temporally  encoded  sequences  repre¬ 
sented  by  a  low  frequency,  Fourier  magnitude  feature  set  [7].  Thus,  a  direct  five  class  performance 
comparison  was  attempted  by  simply  replacing  the  Fourier  coefficients  with  a  set  of  CPARCOR 
features.  The  data  set  consisted  of  the  five  Army  ground  vehicles  previously  shown  in  Figure  3.2 
on  Page  3-4. 

Based  on  a  viewer  centered  approach,  the  HMM  used  the  Linde,  Buzo,  and  Grey  (LBG)[11] 
clustering  algorithm  to  create  a  vector  quantizer  whose  clusters  corresponded  to  areas  (on  a  viewing 
sphere)  of  similar  characteristic  view[7].  For  test  purposes,  the  region  of  interest  was  restricted  to 
a  0°  to  180°  azimuth,  0°  to  75°  elevation  portion  of  the  viewing  sphere.  A  total  of  592  images 
per  class  were  generated  for  this  view  region.  Both  the  CPARCOR  and  Fourier  coefficients  were 
statistically  normalized  across  all  feature  vectors  to  produce  zero  mean,  unit  variance  features. 
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(a) 


Figure  4.7 


Number  of  ChMlM' C«nlM«  (eedvbeok  alM) 

(b) 

Comparison  of  Mean  Distortion  Curves  (MDC)  for  cluster  centers  (a)  CPARCOR 
features  (b)  Fourier  features. 


A  codeword  vector  quantizer  was  produced  by  processing  the  feature  vectors  with  the  LBG 
algorithm.  The  result  was  a  cluster  distortion  versus  codebook  size  plot  as  shown  in  Figure  4.7  for 
both  the  CPARCOR  and  the  Fourier  feature  sets.  Based  on  this  data  and  a  desired  apples  to  apples 
performance  comparison,  a  64  codeword  vector  quantizer  was  chosen. 

Sphere  plots  were  used  to  associate  the  mapping  of  each  viewing  position  to  a  particular 
codebook  entry.  Basically,  each  viewing  position  becomes  associated  with  a  characteristic  view. 
Viewing  spheres  for  both  the  CPARCOR  and  the  Fourier  data  are  shown  in  Figure  4.8  on  Page  4- 
19.  Areas  of  common  shading  represent  a  shared  characteristic  view  or  aspect.  Clearly  the  spheres 
in  (b)  display  more  highly  defined  clusters  than  those  in  (a).  Based  on  this  data  and  the  idea 
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that  recognition  ambiguity  occurs  when  more  than  one  object  class  occupies  the  same  cluster, 
classification  results  were  forecast  to  be  much  lower  for  the  CPARCOR  features. 

Training  of  respective  class  HMMs  was  attempted  on  the  same  200,  randomly  generated 
sequences  of  128  x  128  images  previously  used  for  the  Single  and  Multiple- Look  K-Nearest-Neighbor 
(KNN)  tests  (Section  4.2).  However,  due  to  the  inconsistent  sphere  clustering  (noted  earlier),  three 
of  the  five  classes  refused  to  train,  and  the  two  that  did  were  of  questionable  worth.  To  see  if 
the  problem  could  be  corrected,  training  attempts  were  made  with  the  use  of  both  more  and  less 
codebook  vectors,  as  well  as  unnormalized  data.  Although  the  HMMs  eventu2dly  trmned  on  a  set 
of  unnormaJized,  16  codebook  vector  data,  the  test  was  dismissed  eis  being  too  aunbiguous. 

In  an  attempt  to  determine  why  the  CPARCOR  features  failed  to  provide  geographically 
oriented  sphere  clustering  properties,  the  mean  and  standard  deviation  of  the  feature  vectors  for 
each  class  (unnormalized  data)  was  examined.  As  shown  in  Figure  4.9  (a)  oi  Page  4-20,  little 
spatial  separation  was  seen  to  exist  between  each  of  the  five  classes.  Also,  Figure  4.9  (b)  illustrates 
that  only  a  slight  amount  of  variance  existed  between  each  class  (note  axis  scale).  Extremely  close 
to  unity  for  all  five  classes,  first  order  coefficients  displayed  very  little  variation.  Recall  that  the 
feature  vector  of  complex  coefficients  was  sepairated  into  both  real  and  imaginary  parts  for  this  plot. 
Hence,  the  variance  was  shown  close  to  zero  at  indices  of  one  (real  part)  and  eleven  (imaginary 
part).  Due  to  this  crowding  of  the  CPARCOR  features,  clearly  defined  clustering  weis  not  possible. 
Thus,  for  the  model  order  considered,  classification  by  HMM  was  not  possible. 
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(b) 


Figure  4.8  Comparison  of  5  class  sphere  mapping  (a)  CPARCOR  features  (b)  Fourier  features. 
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Mumhidn 

(a) 


Figure  4.9  Comparison  of  5  class  (a)  Mean  and  (b)  Standard  Deviation  of  CPARCOR  feature 
vectors  (unnormalized  data). 


I 
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Chapter  Summary 


The  CPARCOR  features  were  evaluated  using  several  methods  of  classification.  Template 
Classifiers  were  used  to  examine  the  relative  recognition  rates  for  unstored  and  partially  occluded 
views.  Results  indicated  that  template  reductions  based  on  object  symmetry  were  possible.  Specif¬ 
ically,  the  360°  view  template  was  reduced  to  a  90°  view  template  without  any  significant  impact 
on  recognition  rate.  Feature  space  crowding  was  noted  as  the  cause  of  unusual  recognition  rates  for 
occluded  views.  Although  general  trends  were  noted,  an  optimed  matrix  size  and  coefficient  order 
could  not  be  determined. 

CPARCOR  features  were  also  classified  by  both  Single  and  Multiple-Look  KNN  classifiers. 
Using  a  1-NN  prototype  and  a  twenty  dimensional  model,  recognition  rates  of  69%  and  96%  were 
achieved  for  respective  classifiers.  Based  on  a  direct  comparison  to  Fourier  feature  results,  the 
advantage  of  classification  by  the  multiple-look  technique  over  the  traditional  single-look  method 
was  clearly  shown.  Use  of  the  either  the  CPARCOR  or  Fourier  features  was  noted  to  be  application 
dependent. 

The  inability  of  the  CPARCOR  features  to  exhibit  consistent  clustering  properties  resulted 
in  a  failed  attempt  to  classify  by  Hidden  Markov  Model.  A  plot  of  the  overedl  mean  and  standeud 
deviation  for  the  CPARCOR  features  reveled  that  little  spatied  separation  existed  between  the 
five  classes  considered.  Having  failed  at  attempts  to  establish  a  temporal  relationship  among  the 
CPARCOR  features,  the  Fourier  features  were  proven  more  optimal  for  HMM  classification. 
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V.  Conclusions  and  Recommendations 


Expanding  previous  test  methodologies  to  include  nonplanar  views,  this  thesis  effort  sought 
to  encode  and  evaluate  the  Complex  Partial  Correlation  (CPARCOR)  algorithm  on  three  types 
of  classifiers.  Initially  highlighting  conclusions  from  the  classifier  tests,  this  chapter  also  provides 
several  recommendations  for  future  research. 

5. 1  Conclusions 

The  CPARCOR  method  of  feature  extraction  was  applied  to  nonplanar  2-D  views  of  3- 
D  objects.  Recognition  performance  was  evaluated  based  on  the  following  object  classification 
techniques: 

•  Template  Matching  Algorithm:  Using  a  Euclidean  distance  metric,  the  impact  of  unstored 
and  partiedly  occluded  views  on  single-frame  recognition  performance  was  examined. 

•  K-Nearest- Neighbor  (KNN)  Technique:  Recognition  of  both  single  and  multiple  frames  of 
imagery  was  performed  using  the  hold-one-out  method  and  Single/Multiple-Look  1-NN  clas¬ 
sifiers.  A  direct  comparison  with  recognition  by  Fourier  features  was  made. 

•  Hidden  Markov  Model  (HMM):  Temporal  sequence  recognition  by  CPARCOR  features  was 
attempted  for  comparison  with  recognition  by  Fourier  features. 

Results  indicate  that  the  CPARCOR  model  parameters  provide  useful  shape-features  for 
recognition  of  out-of-plane  rotations.  Displaying  exceptional  PSRI  properties,  the  features  were 
shown  capable  of  classification  by  simple  nonadaptive  recognition  schemes.  Relatively  successful 
results  were  obtained  for  a  variety  of  tests.  The  advantage  of  classification  by  a  multiple-look 
technique  over  the  traditional  single-look  method  was  clearly  demonstrated.  Although  general 
trends  were  noted,  optimal  model  order  and  selection  of  CPARCOR  versus  Fourier  features  were 
considered  application  dependent. 
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5.1.1  Template  Matching  Algorithm  Results.  Results  from  the  template  reduction  and 
unstored  view  tests  clearly  verified  that  object  symmetry  allowed  for  a  reduction  in  overall  template 
and  matrix  size.  Obvious  benefits  of  this  result  are  a  decrease  in  required  storage  capacity  and  an 
increase  in  recognition  processing  time.  Also  validated  was  the  hypothesis  that,  for  a  given  model 
order,  recognition  performance  should  decrease  as  the  number  of  available  classes  was  increased. 
Although  previously  reported  high  test  results  for  p/anor  views  were  verified,  an  optimal  CPARCOR 
model  order  and  corresponding  template  size  could  not  be  determined  based  on  results  for  the 
nonplanar  views.  Unstored  and  occluded  view  test  results  revealed  a  crowding  oi  the  feature  space 
that  resulted  in  several  classification  inconsistencies.  In  general,  though,  recognition  rates  clearly 
verified  that  reasonably  accurate  classification  based  on  a  limited  number  of  stored  characteristic 
views  was  possible.  Results  also  indicated  that  reasonably  high  recognition  rates  could  be  achieved 
by  using  less  similar  cl2isses,  as  well  as  fewer  classes. 

5.1.2  K- Nearest- Neighbor  (KNN)  Technique  Results.  The  advantage  of  classification  by 
a  multiple-look  technique  over  the  traditional  single-look  method  was  clearly  demonstrated  by  the 
KNN  tests.  As  discussed  in  Section  4.2,  a  dramatic  increase  in  the  recognition  rate  (69%  single 
versus  96%  multiple-look)  was  noted  for  the  CPARCOR  coefficients.  Based  on  a  direct  comparison 
between  CPARCOR  and  Fourier  features,  the  multiple-look  technique  was  noted  to  provide  higher 
classification  rates  for  each  case.  Although  recognition  rates  by  Fourier  features  were  notably  higher 
than  those  of  the  CPARCOR  features,  the  actual  application  should  be  considered  before  deciding 
on  which  method  to  use.  For  example,  the  PSRI  properties  of  the  CPARCOR  features  are  inherent, 
while  additional  signal  processing  is  required  in  order  to  achieve  similar  results  with  Fourier  features. 
Also,  the  CPARCOR  features  may  be  more  readily  capable  of  dealing  with  peirtial  occlusions  than 
Fourier  features.  For  either  case,  though,  it  has  been  shown  that  classification  should  be  performed 
based  on  multiple-look  techniques. 
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5.1.3  Hidden  Markov  Model  (HMM)  Teat  Reaulia.  Although  the  CPARCOR  features 
display  significantly  better  PSRI  properties  than  Fourier  features,  the  CPARCOR  coefficients  failed 
to  train  on  the  HMM  classifier.  Although  only  a  tenth  order  CPARCOR  model  was  applied  (for 
Fourier  comparison),  similar  poor  results  were  expected  for  other  model  orders  as  well  due  to 
feature  space  crowding.  Based  on  failed  attempts  to  establish  a  temporal  relationship  among  the 
CPARCOR  features,  the  Fourier  features  were  proven  more  optimal  for  HMM  classification. 

5. 2  Recomme  ndationa 

The  rather  promising  results  of  multiple-look  1-NN  tests  indicate  that  a  simple,  paeudo- 
temporal,  template  matching  classifier  could  easily  be  developed.  Based  on  the  moderately  high 
recognition  rates  from  the  Template  Matching  tests,  an  experimentally  determined  optimal  model 
order  and  matrix  size  could  be  combined  with  a  multiple-look  classifier.  The  classifier  would  be 
paeudo-temporal  in  the  sense  that  aequence  order  would  not  be  important.  Obvious  advantages  to 
this  approach  are  high  recognition  rates  and  inherent  design  simplicity. 

5.3  Future  Research 

Although  extensive  autoregressive  model  research  could  be  conducted  in  a  variety  of  areas, 
of  particular  interest  are  the  following: 

•  Image  reconstruction  from  the  CPARCOR  coefficients. 

•  Impact  on  recognition  rates  from  the  application  of  a  boundary  smoothing  technique. 


•  Characteristic  View  selection  by  clustering  algorithm. 


Appendix  A.  Supplemental  Test  Results 


To  supplement  results  contained  in  Chapters  III  and  IV,  this  appendix  contains  additional 
results  obtained  from  the  Template  Matching  algorithm  tests.  Intended  only  as  additional  refer¬ 
ence  material,  detailed  explanations  are  not  provided.  The  additional  figures  are  presented  in  the 
following  order: 

1.  Individual  Order  Teat.  Conducted  to  determine  if  certain  CPARCOR  coefficients  provide 
more  recognition  information  than  others.  Using  only  aingle-ordera  of  CPARCOR  coefficients, 
instead  of  the  usual  combined  effect,  recognition  was  attempted  using  the  data  set  of  unstored 
views  from  Table  3.2.  Figure  A.l  (a)  (Page  A-2)  clearly  illustrates  that  no  single  order  provides 
consistently  higher  recognition  rates  than  any  other.  For  clarity,  the  mean  recognition  rate 
at  each  order  is  shown  in  (b).  Although  some  fluctuation  is  evident,  no  one  order  is  noted  to 
have  a  more  optimal  recognition  performance  than  the  others.  Thus,  the  combined  effect  of 
the  CPARCOR  coefficients  was  applied  to  all  higher-order  tests. 

2.  The  360*^  View  Template  consisted  of  a  3  x  3  matrix  (9  characteristic  views)  for  each  class. 
Results  of  both  a  three  and  a  five  class  test  are  shown  in  Figure  A.2  (a)  and  (b)  (Page  A-3), 
respectively. 

3.  A  comparison  of  recognition  results  for  both  3  and  5  Classes  using  the  360°(3  x  3)  template 
is  shown  in  Figure  A.3  (Page  A-4). 

4.  Results  for  a  3  class  comparison  of  the  180°  and  360°  view  templates  are  shown  in  Figure  A.4 
(Page  A-5).  In  general,  no  significant  diflerence  in  recognition  rates  were  noted. 

5.  Results  of  tests  performed  on  the  180°  view  template  to  compare  recognition  rates  for  vuious 
matrix  sizes  are  shown  in  Figure  A.5  (Page  A-6). 

6.  Direct  comparisons  of  the  90°  and  180°  templates  for  recognition  of  a  three  class  problem 
(M60,  BTR,  M35)  are  shown  in  Figures  A.6  to  A. 8  (Pages  A-7  to  A-9). 

7.  Figure  A.9  (Page  A-10)  shows  the  results  of  a  three  dissimilar  class  (M60,  BTR,  M35)  and 
five  class  occlusion  test  using  various  matrix  sizes. 

8.  Figure  A. 10  (Page  A-11)  shows  the  results  of  a  three  similar  class  (MOO,  T62,  M2)  and  five 
class  occlusion  test  using  various  matrix  sizes. 

9.  A  comparison  of  both  3  and  5  Class  recognition  of  occluded  MOO  characteristic  views  using 
90°  and  360°  templates  is  shown  in  Figure  A.ll  on  Page  A-12.  In  general,  no  significant 
difference  in  recognition  rates  were  noted. 

10.  Figure  A. 12  (Page  A-13)  and  Figure  A. 13  (Page  A-14)  show  results  for  occluded  M2  view 
tests. 

11.  A  comparison  of  the  mean  recognition  rates  (based  on  matrix  size)  for  both  the  MOO  and  the 
M2  is  shown  in  Figure  A. 14  (Page  A-15). 


A-1 


1( 
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Figure  A.l  Single-order  CPARCOR  coefficient  recognition  ^ulaly8i8:  (a)  Recognition  results  for 
all  five  classes  (b)  Mean  value  of  recognition  results  (all  five  classes). 
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Figure  A.2  Unstored  view  comparison  using  360°(3  x  3)  for  (a)  3  Classes  and  (b)  5  Classes. 
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Figure  A.3  Comparison  of  recognition  results  for  3  and  5  Class,  360°(3  x  3)  template 
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Figure  A.4  Comparison  of  180°(2  x  3)  and  360°(3  x  3)  template  for  unstored  view  recognition 
results  with  3  classes. 
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Figure  A.6  Comparison  using  90°  (2  x  2, 5  x  5, 10  x  10)  and  180°  (2  x  3, 5  x  9, 10  x  19)  templates 
for  unstored  view  recognition  of  the  MOO. 
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Figure  A. 7  Comparison  using  90°  (2  x  2, 5  x  5, 10  x  10)  and  180°  (2  x  3, 5  x  9, 10  x  19)  templates 
for  unstored  view  recognition  of  the  BTR. 
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Figure  A.8  Comparison  using  90°  (2  x  2, 5  x  5, 10  x  10)  and  180°  (2  x  3, 5  x  9, 10  x  19)  templates 
for  unstored  view  recognition  of  the  M35. 
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Figure  A. 9  Comparison  of  Occlusion  results  for  15  occluded  M60  characteristic  views  using 
similar  3  class  set  (M60,  BTR,  M35). 
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Figure  A.  10  Occlusion  results  for  15  occluded  M60  characteristic  views  using  similar  3  class 
(M60,  M2,  T62). 
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Figure  A.l  1  Comparison  of  90°  and  360°  templates  for  recognition  of  occluded  M60  chareicteristic 
views  (a)  3  Classes  (b)  5  Classes. 
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(c)  (d) 

Figure  A.  12  3  class  recognition  rates  for  10  occluded  M2  views.  Various  matrix  sizes  used  are  (a) 

2x2  Matrix  (b)  5  x  5  Matrix  (c)  10  x  10  Matrix  (d)  19  x  19  Matrix. 
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Figure  A.  13  3  class  recognition  rates  for  10  occluded  M2  views.  Various  matrix  sizes  used  are  (a) 

2x2  Matrix  (b)  5  x  5  Matrix  (c)  10  x  10  Matrix  (d)  19  x  19  Matrix. 
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Figure  A.  14  Mean  value  of  occlusion  recognition  rates  for  veirious  size  templates.  M60  and  M35 
results  shown  for  3  and  5  Classes. 


A-15 


Appendix  B.  White  Noise  Test  Source  Code 


B.  1  Introduction 


This  appendix  contains  M  ATLAB  software  routines  and  relevant  data  sets  used  for  the  CPAR- 
COR  coefficient  white  noise  test  (see  Chapter  III).  Material  is  presented  in  the  same  order  as  it 
was  used  for  testing. 


B.2  MAT  LAB  Test  Routines  and  Data  Sets 


This  routine  will  generate  a  sat  ot  stable  test  coefficients  for  use 
vith  the  shite  noise  sequence  routine.  For  stability,  all  points 
must  lie  sithin  the  unit  circle. 

xxxxy.x%xxxxy.xxxxxxxxx%xxxxxxxxxmxxxxxx%y.xxxxxxxxxy.xxxxxxxxxxxx%xxx%x 

X  To  insure  stability,  the  following  parameters  were  used  to 

X  define  the  actual  test  data: 

rsC0.9g  0.96  0.90]; 

freq=[266  2330  3000]; 

f 8=8000; 

fp=125; 

np=i00; 

X  Compute  polar  representation 
rads= (f req/ (f  s/2) } epi ; 

X  Compute  cartesian  coordinates 
x=r.*co8(rad8} ;  y=r.e8in(rad8} ; 
p0=x+y* j ; 
pl=x-y*j ; 

X  Compute  the  polynomial  representation 
pp=[p0  pi] ; 
ppoly=real (poly (pp) ) ; 
a=ppoly 

X  Application  of  the  above  code  results  in  the  following  test  vector: 

X 

X  a=Cl.0O00  -0.1807  -0.1023  -1.2678  0.0226  0.0939  0.7166]; 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

This  routine  will  generate  the  complex  white  noise  sequence  based  on 
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a  set  ol  paraaeters  delined  by  both  the  user  and  the  aodel's 
transfer  function. 

%mxmxxmmxxmxmxxxmmmxxxmxmmxmxxmxmxmxn 

X  The  following  paraneter  assignaents  were  used: 

IslOOOO;  X  Desired  shite  noise  sequence  length 
s^l;  X  Variance 

bsl;  X  Defined  based  on  the  aodel’s  transfer  function  (Y(z)  =  1). 


x-sqrt(s/2)e(randn(l,l)>jerandn(l,I)):  X  Generates  the  CVGI  sequence 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


This  routine  sill  filter  the  data  in  vector  ‘z’  with  the  filter 
described  by  vectors  'a’  and  'b'  to  produce  the  filtered  data  'y' 
In  this  case,  'y’  is  the  filter's  infinite  iapulse  response. 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


y=filter(b,a,x): 


z=y;  X  Arbitrary  label  used  to  define  the 

X  vector  input  for  the  CPARCOR  algorithm 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


Following  are  the  actual  paraaeters  applied  to  the  CPARCOR  auLgorithm 
for  generation  of  the  complex  coefficients,  lots  that  only  the 
results  of  the  CPARCOR  algorithm  are  included  here,  not  the  code. 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


k  =  0; 
sum  =  0; 

I  =  10000; 
order  =  6; 


X  Must  be  initialized  to  zero  for  proper  indexing 
X  Must  be  initi2^.ized  to  zero  for  proper  indexing 
X  Defined  based  on  the  number  of  sample  points 
X  User  specified  model  order,  from  1  to  H 


X  Application  of  the  CPARCOR  algorithm  results  in  the  following  coefficients: 

X  coeff  =  [-0. 1846+0. 0017i  -0.0993-0.00071  -1.2522+0.00481  0.0231-0.00181 
0.0922-0.00261  0.7138+0.00061] 


This  routine  will  generate  a  pole-zero  plot  of  the  data  defined  by 
the  coefficients  shown  above.  The  resulting  plot  was  included  in 
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Chapter  III. 


xmxxmmmmxxxmxxmxxmxmxxm%xm%mmmmmmxxxx 


th0=poly2th(  Cl  coeK]  .  Cl] ) ; 

Czepo,k]=th2zp(th0) ; 
zpplot(zepo) 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


This  routine  sill  transform  the  pole-zero  plot  back  into  a  vector  form. 
A  compeurison  of  the  original,  CPARCQR  zdgorithm,  and  pole-zero  plot 
coefficients  can  then  be  made. 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


th  =  ar(z,6, ’Is' , ’ppe’ ) ; 
ppoly  =  th2poly(th) ; 


X  The  result  is  the  folloeing  vector: 

X  ppoly  =  Cl. 0000  -0. 1844-0. OOlSi  -0 . 0993+0 . 0008i  -1. 2523-0. 0048i 
0.0230+0.00171  0.0922+0.00231  0.7139-0.00061] 
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Appendix  C.  CPARCOR  Algorithm  Source  Code 


C.  1  Introduction 

This  appendix  contains  a  listing  of  the  CPARCOR  algorithm  as  written  in  the  MATLAB 
programming  language.  The  following  preliminary  subroutines  were  required  to  generate  software 
compatible  test  images; 

1.  Load  and  Enhance  Subroutine:  Creates  a  smooth,  unbroken  image  boundary. 

2.  Trace  Subroutine:  Traces  the  image’s  boundary  and  calculates  its  centroid. 

3.  Sampling  Subroutine:  Provides  a  sampled  version  of  the  image’s  traced  boundary. 


C.2  CPARCOR  Algorithm 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


CPARCOR  ALGORITHM:  Using  a  ssriss  of  subroutinss,  the  CPARCOR 
coefficients  of  a  2-D  image  are  calculated  for  a  model  of  order  'm' 


written  by:  David  E.  Chelen 
date:  JULY  93 


y;m/mxxxyx/x/.xxxxy;/x/.xxxxxxxxxxxxxxxxxxxxxxxxxxxyr/.xxxxxxxxxy^ 

y;m.xxxxyx/x/xmxxxy//.xxxyj.xxxxxxxxxxxxxxxxxxxyxmxxxxxxxxxxx^^^ 


xxxxxxxxyx/x//mxxxxxxyx/.xxxxxxxxxxxxxxxxxxxxxxxxxxy;m^^^ 


Load  and  Enhance  Subroutine:  This  subroutine  vill  first  read  a  file 
into  MATLAB  and  then  run  an  edge  detecting  mask  over  the 
surface  to  enhance  the  image's  boundary. 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


lo2td  FILEIAME.ext  X  Load  in  image  file 

function  coeff  =  code(flip)  X  Declare  the  algorithm  a  function 

X  that  vill  return  the  variable  'vies' 


X  =  viesCl: 128, 1:128): 


X  Assign  image  to  128x128  array 


a  =  [30  60  30;  000;  -30  -50  -30]; 
exh  =  conv2(x,a); 
exv  =  conv2(x,a'); 


X  Edge  enhancing  mask 
%  Enhance  horizontal  edges 
X  Enhance  vertical  edges 
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ax  =  sqrt(axh.‘2  +  axv.*2); 


X  Combina  tba  two  anhancad  adgaa 


x=(ax>0);  y.  Thrashold  for  iaagasl,  backgrotuul=0 


y.y.y.xxy.y.%y.yx/.y.xy.%x%y.y.xxxxy.xxxy.y.y.y.xxxxxxiixmxxmxxxxy.xxmm%xxxxmx 


Traca  Subroutina:  This  subroutina  is  usad  to  trace  tha  iaaga's 
boundary.  Basad  on  tha  boundary  points  ratumad.  the  iaaga's 
cantroid  is  also  datarainad. 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


a  =  1;  X 

foundit  =0;  X 

maxB  =  128;  X 

maxn  =  128;  X 

whila  m  *=  foundit  X 

a  =  1+a;  X 

for  n  =  l:aaxn  X 

if  x(a,n)  ==  0 
olda  =  a; 
sa  =  a; 

oldn  =  n; 

sn  s  n; 

alsa  foundit  »  m;  X  Sat  FLAG  'hi*  whan 

iaaga  pixal  found 

start  =  [sa  sn] ; 

lastzaro  =  [sa  sn] ;  X  Kaap  location  of  last 

background  pixal 

stop  =  [a  n] ; 

fir stone  =  [a  n] ;  X  Keep  location  of  first 

iaaga  pixel 

break 

end 


Index  variable  assignaent 

FLAG  to  indicate  start  of  ia2iga  pixels 

Naxiaua  'a*  for  tha  axn  aatrix 

Maxiaua  'n'  for  tha  axn  aatrix 

Scan  aatrix  until  an  iaaga  pixal  is  found 

Incraaant  aatrix  row  (a) 

Continue  until  last  coluan  (n)  is  reached 


end 

end 


t=l;  X  Index  variable  assignaent 

flag  =1;  X  FLAG  set  for  VHILE  loop 

while  flag 

if  stop  ==  start  X  Keep  going  until  return  to  stzurt  point 
break 
flag  =  0 

else  if  oldn  <  n  X  Decide  which  way  to  turn  based  on 
X  the  current  pixal  value 
oldn  =  n; 
if  x(a.n)  ==  1 

tap(t)  =  Ca+(n*i)] ; 
t=t+l; 
a  =  a-1; 
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•la«  ■  = 

and 

elaaif  oldn  >  n 
oldn  =  n; 
if  xCa.n)  ==  1 

tnpCt)  =  Da+Cn*!)] ; 
t=t+l; 

■  =  m+1; 

alae  d  =  m-1; 
and 

alaaif  oldn  <  m 
oldn  =  m; 
if  z(m,n)  ==  1 

tnp(t)  =  Cm+(n*i)] ; 
t=t+i: 
n  =  n+l; 
alaa  n  =  n-1; 
and 

alaaif  oldm  >  m 
oldn  =  m; 
if  x(a.n)  ==  1 

tmp(t)  =  Cn+(n*i)] ; 
t=t+l; 

n  =  n-l; 
alaa  n  *  n+l; 
and 

and 

atop  =  Cm  n] : 

and 

and 

tmpaiza  =  length(tmp) ;  '/,  Find  tha  langth  of  tha  tracad  boundary 

ztmp(l)  =  tmp(l);  X  Store  firat  point  aa  rafaranca 

b=l;  X  Index  variable  aaaignnent 

for  old=l ;  (tmpaize-1)  '/,  If  pravioua  and  current  pixel 

if  ztmp(b)'=tmp(old+l) ;  '/,  locationa  are  tha  same,  gat 

ztmp(b+l)  =  tmp(old+l);  '/.  rid  of  the  repeated  point 
b=b+l; 
else 
b=b: 

end 

end 

clear  total 
clear  sum 
total  =  siua(ztmp) ; 
size  =  lengthCztmp) ; 
center  =  total/size; 
tmp  =  0; 
ztmp  =  0; 


%  Re-initialize  variables 

Calculate  the  CEITROID  based  on  the 
*/,  boundary  points  returned  by  tha  trace 

y.  Re-initialize  variables 
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mmxxm%y.mmmxmxy.y.mxx%xxxxmmxmmmmxy.xxmxr.m 


Sampling  Subroutine:  This  subroutine  uses  a  modified  version  of  the 
originzQ  Trace  Subroutine  to  return  boundeo-y  points  that  are  nou 
complex  numbers  referenced  from  the  centroid.  The  fined  sampling  rate 
and  model  order  eire  user  specified. 


y/mxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


start  =  lastzero;  X  Assign  the  starting  point  to  the  same  relative 

stop  -  firstone;  X  point  on  the  object  as  sas  originally  used 

mref  -  real(center) ;  X  Create  the  modified  z-fiy  axis 
nref  =  imeig(center) ; 
t=l; 


flag  =1;  X  Set  FLAG  for  shile  loop 

while  flag 

if  stop  ==  start 
break 
flag  =  0; 

else  if  oldn  <  n  X  Decide  which  way  to  turn  based  on 

oldn  =  n;  X  current  pixel  type 

if  x(m,n)  ==  1 

tmp(t)  =  [(m-mref)+(n-nref)*i] ; 


t=t+l; 


m  s  m-1; 
else  m  =  m-fl; 
end 


elseif  oldn  >  n 
oldn  -  n; 
if  x(m,n)  ==  1 

tmp(t)  =  [(m-mref )+(n-nref )ei] ; 
t=t+l; 
m  =  m+1; 

else  m  =  m-1; 
end 

elseif  oldffl  <  m 
oldm  =  m; 
if  x(m,n)  ==  1 

tmp(t)  =  [(m-mref )+(n-nref )*i] ; 
t=t+l; 
n  =  n+1; 
else  n  =  n-1; 
end 


elseif  oldm  >  m 
oldm  =  m; 
if  x(m,n)  ==  1 

tmp(t)  =  [(m-mref )+(n-nref)*i] ; 
t=t+l; 

n  =  n-1; 
else  n  =  n+1; 
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stop  s  [a  u] : 


snd 

snd 


snd 

snd 


tapsizs  =  Isngtli(tap) :  X  Go  through  and  sliainats  rspsatsd  points 

ztap(l)  =  tap(l); 

b=l; 

lor  old=l : (ta^sizs-1) 

il  ztBp(b)'stap(old-M)  : 
ztBp(b-*-l)  =  tap(old-<-l): 
b=b+l; 
slss 
b=b: 
snd 

snd 


I  -  160; 
ordsrsSO ; 

nna  =  IsngthCztap) ; 
intsrval  =  (nua/I); 
il  intsrval  ==  0 
intsrv2d  =  1; 


X  Oslins  ths  dssirsd  nnabsr  ol  saaplss 
X  Sat  ths  ordsr  ol  aodsl  to  bs  gsnsratsd 
X  Sizs  ol  tsap  z  vsctor 
X  Sst  saapls  intsrval 


snd 

z  -  Czti^(l:intsrval:nna)] :  X  Put  saaplsd  valnss  into  tsst  '^sctor 
il  lsngth(z)<l 
|slsngth(z) : 
snd 


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

CPARCOR  Routins:  This  routins  crsatss  ths  CPARCOR  cosllicisnts,  lor  ths 
nssr  spscilisd  aodsl  ordsr,  bassd  on  ths  saaplsd  iaags  vsctor,  Z. 
Although  only  thrss  linss  long  in  original  lora,  ths  NATLAB  vsrsion  ol 
this  algoritha  is  signilicantly  longsr. 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


k  =  0;  X  Initializs  to  zsro  lor  propsr  indsxing 

sua  s  0;  X  Initializs  to  zsro  lor  propsr  indsxing 

X************This  vill  lind  Ro****sssss**sssss 

lor  j=0:(I-i) 

L  =  z(j+l)*conj(z((j+l)-k)); 
sua  =  sua  L; 
and 

Ro  s  (l/l)ssua; 
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X************Thi8  gets  all  the  other  R'see********** 

R  =  0; 

lor  k  s  1; order 
avm  »  0; 

lor  j=0 :(■-!) 

il  ((j+l)-k)  <=0 

L  =  z(j+l)*coiij(*(I+((j+l)-k))); 

else 

L  =  z(j+l)*coiij(z((j+l)-k)); 
end 

SUB  =  SUB  +  L; 

end 

R(k)  =  (1/I)*sub: 
end 

X***e*Thi8  gets  the  CAR  and  CPARCOR  Coellicients*********** 

a  =  0; 

a(l)  =  R(l)/Ro; 

Cpar(l)=a(l):  X  First  Pb  is  a(l) 

lor  B  =  2: order 

Rllp  =  lliplr(R(l:B-l)); 

Atr  =  a(l:B-l) . ’ ; 

Pb  *  (R(b)  -  (RllpeAtr))/(Ro-(R(l:B-l)econj(Atr)));  XCPARCOR  coelf 
Cpar(B)  =  Pb; 

a(l:B)  =  [(a(l:B-l)  -  conj((lliplr(a(l:B-l))))«PB)  Pb];  XCAR  coell 
car(l:B)=a(l:n}; 

end 

coell=Cpar;  X  CPARCOR  coellicients  returned  by  the  lunction  call 
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Appendix  D.  Template  Matching  Algorithm 


D.l  Introduction 

This  appendix  contains  a  listing  of  the  Template  Matching  algorithm  as  written  in  the  MAT- 
LAB  programming  language.  Classification  is  based  on  a  Euclidean  distance  metric  defined  as 
follows: 


D{x)  =  min  II X  —  a,- 1|  (D-l) 

where  ||  ■  ||  defines  the  Euclidean  norm  for  the  expression,  and  D{x)  is  the  smallest  of  all  distances 
between  the  unknown  input-view  (x)  and  the  tth  stored  template-view  (sj)[18].  In  other  words, 
classification  is  based  upon  the  minimum  Euclidean  distance  between  a  known  template-view  and 
the  unknown  input-view.  Templates  were  based  on  m  x  n  matrices  that  contained  unnormalized 
CPARCOR  coefficient  vectors  representing  the  characteristic,  views  for  various  allowable  object 
orientations.  Individual  views  are  labeled  with  the  following  format; 


class(azimuth)4clevation)(  1  :model  jorder) 


where  individual  classes  are  assigned  to  class  designators  a  thru  e.  For  example,  the  M60 
Tank  was  assigned  to  the  designator  a,  while  the  BTR  A  PC  was  labeled  c.  A  brief  description  of 
each  template  follows: 

•  360**  View  Template  (3x3  Matrix):  Contains  the  minimum  number  of  characteristic 
views  (CPARCOR  coefficients)  required  for  total-range  360°  azimuth  and  90°  elevation  image 
recognition. 

•  180®  View  Template  (2  x  3,5  x  9,10  x  19  Matrix);  Representing  the  same  180°  view, 
the  three  matrices  contain  different  numbers  of  characteristic  views.  For  example,  the  2x3 
matrix  contains  six  characteristic  views,  while  the  10  x  19  contains  190. 

•  90®  View  Template  (2  x  2, 5  x  5, 10  x  10, 19  x  19  Matrix):  Although  representative  of 
the  same  90°  view,  the  2x2  matrix  does  it  with  only  four  characteristic  views,  while  the 
19  X  19  uses  361.  Note  that  the  19  x  19  contains  the  maximum  number  of  characteristic 
views  possible  for  the  given  range.  Thus,  the  19  x  19  matrix  was  not  used  for  unstored  view 
recognition  tests. 
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D.  2  Matrix  Source  Code 


The  following  source  code  creates  the  template  for  application  to  the  Template  Matching 
algorithm.  Examples  of  the  actual  test  matricies  (described  above)  are  also  included. 


xxmmxxmxxxxmxxxmxxmxxxxmxxmxxmxxxxmmxxmxxxxm 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

MATRIX  GEIERATIOI:  Different  Matricies  for  Different  Teaplates 
written  by:  David  E.  Chelen 
date:  AUG  93 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

X  Enter  all  initial  test  paraaeters  in  this  section 

order  =  IX  X  EITER  order  of  test  CPARCOR  aodel 

nsize  =  XX;  X  EITER  *n'  of  the  un  aatriz 

■size  =  XX;  X  EITER  the  *■*  of  the  azn  matrix 

nunclasses  =  XX;  X  EITER  the  number  of  classes  being  tested 

frac  s  order  X  Variables  assigned  to  represent  model  order 
fsfrac 

■stop  =  (msizeenumclasses)-*-!;  X  Identify  the  final  matrix  row 
elements=orderensize ;  X  Size  of  matrix  rows 

fl=(-9)*ones(l, order) ;  X  Set  matrix  boarder  elements  to  a  known  value 

botbsC(-9)*ones(l, elements)] ;  X  Boarder  to  ‘pad*  end  of  matrix 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

EXAMPLE  of  360  degree  template,  3x3  matrix  per  class  (6  classes) 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

rl  =  [a31S_0(i:f)  a0_0(l:f)  a45_0(l:f)];  X  M60  views 

r2  =  Ca270_0(l:f)  a0_90(l:f)  a90_0(l:f)]; 

r3  =  Ca226_0(l:f)  al80_0(l;f)  al36_0(l:f )] ; 

r4  =  [b316_0(l:f)  b0_0(l:f)  b46_0(l:f)];  X  M35  views 

r6  =  [b270_0(l:f)  b0_90(l:f)  b90_0(l:f)3; 

r6  =  [b225_0(l:f)  bl80_0(l:f)  bl35.0(l:f )] ; 

r7  =  Cc316_0(l:f)  c0_0(l:f)  c46_0(l:f)3;  X  BTR  views 

r8  =  Cc270_0(l:f)  c0.90(l:f)  c90_0(l:f)]; 

r9  =  [c226_0(i:f)  cl80_0(l:f)  cl35_0(l:f)] ; 

no  =  Cd3i5_0(l:f)  d0_0(l;f)  d46_0(l:f)];  X  T62  views 
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rll  »  Cd270_0(l:l)  d0_90(l:f)  d»0_0(i:f)]; 

rl2  *  Cd22S_0(l:l)  dl80_0(l:l)  dl36_0(l:f)] ; 

rl3  =  [•315_0(l:f)  aO.OCl:!)  adS.Od:!)];  X  H2  viawa 
rl4  =  [•270_0(1:«)  •0_90(l:f)  •90_0(l:f)]; 
rlS  -  [•225.0(1:1)  alSO.Od:!)  •135.0(1:1)]: 

taaplata  =  [rl:r2:r3:r4:r5;r6;r7;r8;r9;rl0;rll;rl2;rl3;rl4:rl5;botb] ; 

xxxxxxxxxxxxxxxxmxxmxxxxxxxmxxxxxxxxxxxmxxmxxxxxxxxxxxmxxx 

EXAMPLE  ol  180  dagraa  taaplata,  2x3  aatrix  par  claaa  (3  claaaas) 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

rl  =  [a0.0(l:l)  a45.0(l:l)]; 

r2  =  Ca0.90(l:l)  a90.0(l:l)]; 

r3  »  Cal80.0(l:l)  al3E.O(l:l)3; 

r4  *  [b0.0(l:l)  b45.0(l:l)]; 

rE  »  Cb0.90(l:l)  b90_0(l:l)3; 
r6  =  [bl80.0(l:l)  bl35.0(l:l)] ; 

r7  =  Cc0.0(l:l)  c45_0(l:l)]; 

r8  s  Cc0_90(l:l)  c90_0(l:l)]; 

r9  »  Ccl80_0(l:l)  cl35.0(l:l)] ; 

taaplata  «  Crl:r2:r3:r4:r5:r6;r7:r8;r9;botb] ; 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

EXAMPLE  ol  90  dagraa  taaplata,  5x5  aatrix  par  class  (3  classas) 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

rl  =  [a0.0(l:l)  al5.0(l:l)  a25.0(l:l)  a35_0(l:l)  a45.0(l:l)]; 
r2  =  Ca0.25(l:l)  al5.25(l:l)  a30.25(l:l)  a45_25(l:l)  a55.0(l:l)]; 
r3  =  [a0_50(l:l)  a25_50(l:l)  a45.50(l:l)  •60.25(1:1)  a65.0(l:l)]; 
r4  =  [a0_75(l:l)  a45_70(l:l)  a65_50(l:l)  a75.25(l:l)  a75.0(l:l)]; 
r5  *  [a0.90(l:l)  a90_75(l:l)  a90_50(l:l)  a90.25(l:l)  a90_0(l:l)]; 

r6  ®  CbO.Od:!)  bl5.0(l:l)  b25.0(l:l)  b35.0(l:l)  b45.0(l:l)]; 
r7  *  [b0.25(l:l)  bl5.25(l:l)  b30_25(l:l)  b45.25(l:l)  b55_0(l:l)]; 
r8  *  [b0.50(l:l)  b25.50(l:l)  b45.50(l:l)  b60.25(l:l)  b65.0(l:l)]; 
r9  =  [b0.75(l:l)  b45.70(l:l)  b65.50(l:l)  b75_25(l:l)  b75.0(l:l)]; 
rlO  =  [b0.90(l:l)  b90_75(l:l)  b90.50(l:l)  b90_25(l:l)  b90.0(l:l)]; 

rll  »  Cc0_0(l:l)  cl5.0(l:l)  c25_0(l:l)  c35_0(l:l)  c45.0(l:l)]; 
rl2  »  Cc0_25(l:l)  cl5_25(l:l)  c30_25(l:l)  c45.25(l:l)  c55_0(l:l)]; 
rl3  «  [c0_50(l:l)  c25_50(l:l)  c45_50(l:l)  c60_25(l:l)  c65_0(l:l)]; 
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rl4  =  CcO_76(l;f)  c4S_70(l:l)  c66_50(l;f)  c75_26(i:f)  c76_0(l:l)]; 
rl5  =  [c0_90(l:f)  c90_76(l:f)  c90_50(l:f)  c90_26(l:l)  c90_0(l:f)]: 

tmplatfl  =  Crl ;r2;r3:r4;r6:r6:r7;r8:T9:rl0;rll;rl2;rl3:rl4;rl5:botb] ; 

xmxmxxxmmmxxmmxxmxxxxxmxxmxmxmmxmxxxmxxx 

EXAMPLE  oi  90  degrae  taaplata,  10x10  matrix  par  class  (1  class) 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


rl 

= 

Ca0_0(l:f) 

aB_0(l:f) 

al0_0(l:f) 

al6_0(l:l) 

a20_0(l:f) 

a2B_0(l:f) 

a30_0(l:f) 

a3B_0(l:f) 

a40_0(l:f) 

a4B_0(l:f)]; 

r2 

= 

Ca0_10(l:f) 

a6_10(l:f) 

al0_10(l:l) 

alB_10(l:f) 

a20_10(l:f) 

a30_10(l:f) 

a3B_10(l:f) 

a40_10(l:f) 

a4B_10(l:l) 

aB0_0(l:l)]; 

r3 

= 

[a0_20(l:l) 

a8_20(l:f) 

al0_20(l:l) 

a20_20(l:f) 

a30_20(l:f) 

a3B_20(l:f) 

a40_20(l;f) 

a4B_20(l:f) 

a60_10(l:f) 

a6B_0(l:f)]; 

r4 

= 

Ca0_30(l;f) 

al0_30(l:l) 

a20_30(l:l) 

a30_30(l;l) 

a3B_30(l:f) 

a40_30(l;f) 

a4B_30(l;f) 

aB0_20(l:f) 

aBB_10{l:f) 

a60_0(l:f)] ; 

rB 

= 

Ca0_40(l:l) 

al0_40(l:f) 

a20_40(l:l) 

a30_40(l:l) 

a40_40(l:f) 

a46_40(l:f) 

aB0_30(l:f) 

aBB_20(l:l) 

a60_10(l:f) 

a6B.0(l:f)]; 

r6 

[aO_BO(l:l) 

al6_B0(l:f) 

a2B.B0(l:f) 

a3B_B0(l:f) 

a4B_B0(l:l) 

aB0_40(l:f) 

aB6_30(l:f) 

a60_20(l:f) 

a70_10(l:l) 

a70_0(l:l)]; 

r7 

= 

Ca0_60(l:f) 

alB_60(l:f) 

a30_60(l:l) 

a4B_60(l:f) 

aBB_60(l:f) 

a60.40(l;l) 

a60_30(l:f) 

a70_20(l:l) 

a7B_10(l:l) 

a7B_0(l:l)]; 

r8 

S 

Ca0_70(l:f) 

a26_70(l:f) 

a46_70(l:l) 

a60_60(l:l) 

a66_60(l:f) 

a70.40(l:l) 

a70_30(l:f) 

a80.20(l:f) 

a80_10(l:f) 

a80.0(l:l)]; 

r9 

~ 

[a0.80(l;f) 

a46_80(l:f) 

a6S_70(l:f) 

a78_60(l:f) 

a76.60(l:f) 

a80_40(l:f) 

a80_30(l:f) 

a86_20(l:l) 

a8B_10(l:f) 

a86_0(l:l)]; 

rlO 

= 

[a0_90(l:f) 

a90_80(l:f) 

a90_70(l:f) 

a90_60(l:l) 

a90_B0(l:f) 

a90_40(l:f) 

a90_30(l:f) 

a90_20(l:f) 

a90_10(l;l) 

a90_0(l:f)]; 

taaplata  =  Crl;r2;r3;r4;r6;r6;r7;r8;r9;rl0;botb] ; 


EXAMPLE  of  90  dagraa  taaplata,  19x19  matrix  par  class  (1  class) 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

al  =  Ca0_0(l:f)  a5_0(l:l)  al0_0(l:l)  al5_0(l:l)  a20_0(l:f)  a26_0(l:f) 
a30_0(l:f)  a36_0(l:f)  a40_0(l:f)  a46_0(l:l)  a60_0(l:f)  a5S_0(l:f) 
a60_0(l:f)  a66_0(l:f)  a70_0(l:f)  a76_0(l:f)  a80_0(l:f)  a86_0(l:l) 
a90_0(l:f)]; 

a2  =  Ca0_6(l:f)  a6_6(l:f)  al0_6(l:f)  al5_6(l:l)  a20_B(l:l)  a2B_6(l:l) 

a30_B(l:f)  a3B_B(l:l)  a40_B(l:f)  a4S_B(l:l)  aB0_B(l:f)  aBB_B(l:l) 
a60_B(l:f)  a6B_B(l:f)  a70_B(l:l)  a7B_B(l:f)  a80_B(l:f)  a8B_B(l:l) 
a90_B(l:f)]; 

a3  *  Ca0_10(l:l)  aB_10(l:f)  al0_10(l:f)  alB_10(l:f)  a20_10(l:f)  a2B_10(l:l) 
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a30_10(l:l)  a36_10(l;i)  a40_10(l:l)  a45_10(l:l)  a50_10(l:l)  a56_10(l:f) 
a60_10(l:i)  a66_10(l:i)  a70_10(l:l)  a76_10{l:f)  a80_10(l:f)  a86_10(l:f) 
a90_10(l:l)]; 

a4  =  Ca0_16(l:f)  a5_16(l:l)  al0_16(l:f)  al5_16(l:f)  a20_16{l:l)  a26_16(l:f) 
a30_16(l:f)  a3S_16(l:i)  a40_16(l:«)  a46_16(l:f)  a60_15(l:i)  a66_16(l:l) 
a60_15(l:l)  a65_16(l:i)  a70_16(l:f)  a75_18(l:f)  a80_15(l:l)  a85_15(l:f) 
a90_16(l:l)]; 

a6  =  [a0_20(l:f)  a5_20(l:l)  al0_20(l:<)  al6_20(l:l)  a20_20(l;f)  a26_20(l:l) 

a30_20(l;f)  a3S_20(l:f)  a^O  20(1:1)  a46_20(l:l)  aB0_20(l:l)  a65_20(l:l) 
a60_20(l:l)  a65_20(l:l)  a70_20(l:l)  a75_20(l:l)  a80_20(l:l)  a86_20(l.l) 
a90_20( 1:1)3; 

a6  =  Ca0_26(l:l)  a5_26(l;l)  al0_26(l:l)  al6_26(l:l)  a20_26(l:l)  a26_26(l:l) 
a30_26(l:l)  a36_26(l:l)  a40_28(l:l)  a48_28(l:l)  a80_28(l:l)  a68_28(l:l) 
a0_28(l;l)  a68_28(l:l)  a70_28(l:l)  a78_26(l:l)  a80_28(l:l)  a88_28(l:l) 
a90_28( 1:1)3; 

a7  =  [a0_30(l:l)  a8_30(l:l)  al0_30(l:l)  al8_30(l:l)  a20_30(l:l)  a28_30(l:l) 

a30_30(l:l)  a38_30(l:l)  a40.30(l:l)  a48_30(l:l)  a80_30(l:l)  a88_30(l:l) 
a60_30(l:l)  a68_30(l:l)  a70_30(l:l)  a78_30(l:l)  a80_30(l:l)  a88_30(l:l) 
a90_30(l:l)3; 

a8  =  Ca0_38(l:l)  a8_38(l:l)  al0_38(l:l)  al8_38(l:l)  a20_38(l:l)  a28_38(l:l) 
a30_38(l:l)  a38_38(l:l)  a40_38(l:l)  a48_38(l:l)  a80_38(l:l)  a8B_3S(l:l) 

a60_38(l:l)  a68_38(l:l)  a70_38(l:l)  a7B_38(l:l)  a80_3B(l:l)  a88_3B(l:l) 
a90_38(l:l)3 ; 

a9  =  Ca0_40(l:l)  a8_40(l:l)  al0_40(l:l)  al6_40(l:l)  a20_40(l:l)  a2B_40(l:l) 
a30_40(l:l)  a38_40(l:l)  a40_40(l:l)  a90_40(l:l)  a48_40(l:l)  a80_40(l:l) 
a86.40(l:l)  a60_40(l;l)  a66_40(l;l)  a70_40(l:l)  a7B_40(l:l)  a80_40(l:l) 
a88_40( 1:1)3; 

alO  =  [a0_48(l:l)  a8.48(l:l)  al0_48(l:l)  al8_48(l:l)  a20.48(l:l)  a28.48(l:l) 
a30_48(l:l)  a38_48(l:l)  a40_48(l:l)  a48_48(l:l)  a80_48(l:l)  a8B.48(l:l) 
a60_48(l:l)  a68_48(l:l)  a70_48(l:l)  a78_4B(l:l)  a80_48(l:l)  a88_4B(l:l) 
a90_48(l:l)3; 

all  =  Ca0_80(l:l)  a8_80(l:l)  al0_80(l:l)  al8_80(l:l)  a20_80{l:l)  a28_80(l:l) 

a30_80(l:l)  a38_80(l:l)  a40_80(l:l)  a48_B0(l:l)  a80_80(l:l)  a88_80(l:l) 
a60_80(l:l)  a68_80(l:l)  a70_80(l:l)  a78_B0(l:l)  a80_80(l:l)  a88_B0(l:l) 
a90_80(l:l)3 ; 

al2  =  [a0_66(l:l)  a8_88(l:l)  alO_8B(l:l)  al8_88(l:l)  a20_8B{l:l)  a28_88(l;l) 

a30_68(l:l)  a38_88(l:l)  a40_88(l:l)  a48_8B(l:l)  a80_66(l:l)  a88_88(l:l) 
a60_8S(l:l)  a68_88(l:l)  a70_88(l:l)  a78_88(l:l)  a80_B8(l:l)  a88_8B(l:l) 
a90_88(l:l)3; 

al3  =  Ca0_60(l:l)  a8_60(l:l)  al0_60(l:l)  al8_eo(l:l)  a20_60(l:l)  a28_60(l:l) 

a30_60(l:l)  a38_60(l:l)  a40_60(l:l)  a48_60(l:l)  aB0_60(l:l)  a88_60(l:l) 
a60_60(l:l)  a68_60(l:l)  a70_60(l:l)  a78_60(l:l)  a80_60(l:l)  a88_60(l:l) 
a90_60( 1:1)3; 

al4  =  [a0_68(l:l)  a8_68(l:l)  al0_68(l:l)  al8_88(l:l)  a20_68(l:l)  a28_68(l:l) 

a30_68(l:l)  a38_68(l:l)  a40_68(l:l)  a4S_68(l:l)  a80_68(l:l)  aB8_68(l:l) 
a60_68(l:l)  a68_6B(l:l)  a70_6B(l:l)  a78_6B(l:l)  a80_68(l:l)  a88_68(l:l) 
a90_68(l:l)3; 

al8  =  [a0_70(l:l)  a8_70(l:l)  al0_70(l:l)  al8_70(l:l)  a20_70(l:l)  a28_70(l:l) 
a30_70(l:l)  a38_70(l:l)  a40_70(l:l)  a48_70(l:l)  a80_70(l:l)  a88_70(l:l) 
a60_70(l:l)  a68_70(l:l)  a70_70(l:l)  a78_70(l:l)  a80_70(l:l)  a88_70(l:l) 
a90_70(l:l)3; 
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«16  =  Ca0_76(l:f)  a6_76(l:f)  al0_76(l:f)  al6_75(l:f)  a20_76(l:l)  a2B_76(l:l) 

a30_76(l:f)  a36_75(l:l)  a40_76(l:f)  a4S_76(l:l)  a60_76(l:l)  a65_76(l:l) 
a60_76(l:f)  a65_76(l:<)  a70_76(l:l)  a75_7B(l:f)  a80_7B(l:l)  a8B_7B{l:l) 
a90_7B(l:l)]; 

al7  =  [a0_80(l:f)  a8_80(l:l)  al0_80(l:f)  al8_80{l:l)  a20_80(l:f)  a28_80(l;l) 

a30_80(l:i)  a38_80(l:i)  a40_80(l:f)  a48_80(l:f)  a80_80(l:f)  a88_80(l:i) 
a60_80(l:l)  a68_80(l:l)  a70_80(l:l)  a7B_80{l;l)  a80_80(l;l)  a88_80(l:l) 
a90_80(l:f)] ; 

al8  =  Ca0_88(l:i)  a8_88(l:l)  al0_88(l:l)  alB_88(l:f)  a20_8B(l:l)  a28_88(l:f) 

a30_88(l:l)  a38_88(l:f)  a40_88(l:f)  a48_8B(l:l)  aB0_88(l:l)  a8B_88(l:i) 
a60_88(l:«)  a68_88(l:f)  a70_88(l:l)  a78_88(l:l)  a80_8B(l:l)  a88_88(l;l) 
a90_88(l:f)] ; 

al9  =  Ca0_90(l:f)  a8_90(l:f)  al0_90(l:f)  al8_90(l:f)  a20_90(l:f)  a28_90{l:f) 

a30_90(l:f)  a38_90(l:i)  a40_90(l:f)  a4B_90(l:l)  a80_90{l;f)  aB8_90(l:i) 
a60_90(l:f)  a68_90(l:f)  a70_90(l:f)  a78_90(l:f)  a80_90{l:t)  a88_90(l:l) 
a90_90(l:l)] ; 


plateA=Cal;a2;a3;a4:aB;a6:a7:a8;a9;al0:all;al2:al3;al4;alB;al6;al7;al8;al9;botb] ; 
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D.S  Template  Matching  Algorithm  Source  Code 

The  Template  Matching  algorithm  returns  the  matrix  location  and  corresponding  class  label 
of  the  best  match  for  a  given  unknown  input  view.  The  source  code  for  this  algorithm  follows: 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

TEMPLATE  MATCHIIG  ALGORITHM :  Clasif ication  based  on  Euclidean  distance 
written  by;  David  E.  Chelen 
date:  AUG  93 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


i=0; 

T=0; 

8tore=0: 
temp=0: 
count=0 : 
llagaO; 
lirstllagsO; 


X  Initiadize  index 
X  Initialize  index 
X  Initialize  variable 


X  FLAGS  for  event  indication 


perfects 'I  found  a  perfect  natch!!'  X  Message  for  perfect  natch 


knownvec  =  inputviev(l rfrac) ;  X  This  is  where  the  unknown  input  view 

X  is  brought  in  (CPARCOR  vector)  amd  set 
X  to  the  desired  nodel  order 


X  Extract  stored  CPARCOR  vectors  fron  tenplate  for  conparison  to  input  view 

for  nstau:t=l:mstop 
if  flag==l 

nstaLTtsnstaurt-l ; 
break 
else 

n8taurt=l; 
nstopsorder; 

for  ngo=l:n8ize 
if  flag==i 
breadc 

else 

tenpvec  =  0; 
w=i: 

for  n  s  nstartrnstop 

tenpvec (w)  s  tenplate (nstart.n);  X  Read  in  tenplate  vectors  to  conpare 


X  Index  for  natrix  rows 
X  FLAG  to  indicate  a  ‘prefect’  natch  was  found 
X  Matrix  'n'  address  of  natching  view 

X  Matrix  ‘n’  address 
X  Stop  based  on  vector  length 

X  Repeat  based  on  natrix  size 
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X  Hith  knom  input  viev 
w=w+i : 

and 

if  fir8tflag==0  X  If  a  'perfect  natch  was  not  found 

store^tenpTec ;  X  set  up  an  initied.  distance  reference 

oldgap=8UB(ab8(knoniTec-tenpvec(l:frac))) :  X  Enclideu  distance  is 
X  defined  here  as  the  sun  of  the  nagnitude  of 
X  the  difference  betueen  the  input  vie*  and 
X  the  stored  vies 

firatflag=l: 

end 

tastvecstenpvec(l:frac) ;  X  Assign  variable  nane  to  stored  tanplate  via* 

X  for  conparison  *ith  input  vie* 


if  kno*nvec==te8tvec  X  If  'perfect*  natch  is  found,  BREAK  out  of  routine 


flag=l; 

store^testvec; 
count =0; 
moldsmstart ; 
perfect 
break 

else  X  If  vie*8  do  not  natch,  find  the  next  best  thing 

ne*gap=sun(abs(kno*nvec-testvec}) ;  X  Euclidean  distance  bet*een  input 
X  and  stored  tenplate  vie*8 

if  ne*gap  <  oldgap  X  Initial  tollerance  conparrison  of  vises 

oldgap  =  ne*gap;  X  Establish  ne*  Euclidean  distance  reference 
store  =0;  X  Reset  variables  to  zero  for  next  run 

i=0; 
f=0: 

count=0 ; 

addm=0; 

addn=0; 

end 

end 

if  neegap  <=  oldgap  X  STORE  the  best  natched  vie* 

f=f-t-l;  X  Increnent  index 

addn(f )=n8tart;  X  Matrix  'n*.  'n*  address  narkers 

addn(f}=n8tart; 
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for  >si; order 


i=i+l; 

8tore(i}-tempvec(m) ; 

•nd 

count=couiit+l;  Xlimber  of  stored  Batches  that  sere  within  tollerance 

end 

nstart  =  nstop->'l;  %  Adjust  starting  point  for  next  run 

nstop  =  nstop-t-order :  %  Adjust  stopping  point  for  next  run 

end 

end 

teap^store;  Assign  STOREd  aatchCes)  to  a  TEMP  vector 

end 

end 

end 

%After  this  routine,  mstart  and  nstart  values  are  in  adda(f) 

%  and  addn(f).  Here,  'f'  should  equal  'count' 

X  CLASS  Assignaents,  based  on  aatrix  location,  are  aade  here. 

%  'XX'  is  the  first  row  of  the  next  class  (ox  the  last  row  of  each 
X  class  plus  one.  Thixik  about  it —  you'll  get  it.  I'm  outta  here. 

if  addB<  XX 
cla88=l; 
elseif  addm<  XX 
cla8s=2; 
elseif  addB<  XX 
class=3; 
elseif  adda<  XX 
cla8S=4; 
elseif  adda<  XX 
class=5; 
end 
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